NOVEL ISOFORMS OF HUMAN 
PREGNANCY-ASSOCIATED PROTEIN-E 
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by reference in their entireties. 
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FIELD OF THE INVENTION 

The present invention relates to novel 
isoforms of a human protein, and particularly relates 
to novel isoforms of human pregnancy-associated plasma 
5 protein-E. 

BACKGROUND OF THE INVENTION 

Pregnancy-Associated Plasma Protein-A 
(PAPP-A) was first identified as a component of a 
circulating protein complex uniquely present in the 

10 serum of pregnant women. Principally of placental, 
that is, fetal, origin, and detectable in maternal 
serum as early as 4 weeks into gestation, PAPP-A has 
proven useful as a readily sampled marker for prenatal 
monitoring of fetal health and diagnosis of a number of 

15 human fetal abnormalities. 

Maternal serum PAPP-A levels normally 
increase throughout gestation. Failure of PAPP-A 
levels to increase at the normal rate — that is, PAPP-A 
levels lower than the average for the respective 

20 gestational age — has been associated with a variety of 
fetal disorders. 

For example, PAPP-A levels have been shown to 
be significantly lower than normal at 10 - 14 weeks 
gestation in pregnancies that subsequently result in 

25 miscarriage, pregnancy-induced hypertension, growth 

restriction, and pre-existing or gestational diabetes 
mellitus. Ong et al . , Brit. J. Obstet. Gynaec. 107: 
1265-70 (2000) . 

As another example, a statistical measure of 

30 maternal serum PAPP-A levels (the median multiple of 

the median (MoM) ) is significantly decreased (less than 
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the 5th centile of normal in 78% of cases) during the 
first trimester in cases of fetal trisomy 18. Tul et 
al. f Prenat. Diagn. 19: 1035-42 (1999). When 
measurement of PAPP-A is combined with measurement of 
5 maternal serum free PhCG, fetal nuchal translucehcy, 
and maternal age, 89% of cases of trisomy 18 can be 
detected with a 1% false-positive rate. 

In the second trimester of pregnancy, 
maternal serum levels of PAPP-A are reduced more 

10 markedly than either alpha fetoprotein (aFP) or free 
beta-hCG in cases of trisomy 18; one study reports 
levels of PAPP-A lower than the 5% centile of normal in 
93% of the cases. Spencer et a 1 . , Prenat. Diagn. 19: 
1127-34 (1999) . 

15 Median maternal serum levels of PAPP-A are 

also significantly reduced at 8 to 14 weeks in 
trisomy 21 gestations. Screening using maternal age, 
serum-free (3-hCG, and PAPP-A at 10 weeks of pregnancy 
has been demonstrated to provide better prediction of 

20 fetal trisomy 21 than one standard test (levels of 

alpha- f etoprotein and hCG, in conjuction with maternal 
age) , and equal predictive value to the test of aFP, 
unconjugated estriol, hCG, and maternal age at 15 - 22 
weeks. Wald et al . , Br. J. Obstet. Gynaecol. 

2 5 103 (5) : 4 07-412 (1996) . 

Although first discovered based upon its 
primary expression in placental tissue, PAPP-A has also 
been detected in ovaries, with expression restricted to 
healthy antral follicles in granulosa cells and healthy 

30 corpora lutea (CL) in a subset of large luteal cells, a 
pattern of expression consistent with a role for PAPP-A 
at the very outset of pregnancy, through control of 
survival, growth, and/or differentiation of the 



dominant ovarian follicle. Hourvitz et a J . , J. Clin. 
Endocrinol. Metab. 85:4916-4920 (2000). 

PAPP-A is a member of the metzincin 
superfamily of metalloproteinases . 

The metzincin gene superfamily was first 
identified based upon topological and sequence 
relationships among the astacins, adamalysins, 
serralysins, and matrix metalloproteinases. These zinc 
endopeptidases share topological similarity with 
respect to a five-stranded beta-sheet and three alpha- 
helices arranged in typical sequential order. The 
common consensus motif, HEXXHXXGXXH, found in PAPP-A at 
residues 482-492, contains three histidine residues 
which are involved in binding of the catalytically 
essential zinc ion. Stocker et. a J . , Protein Sci. 
4:823-40 (1995). Metzincins also possess a conserved 
methionine residue in spaced relationship to the zinc- 
binding motif; in PAPP-A, the conserved methionine is 
believed to be the methionine at residue 556. 

PAPP-A has been demonstrated specifically to 
cleave insulin-like growth factor binding protein 4 
(IGFBP-4), Lawrence et a 1 . , Proc. Natl. Acad. Sci. USA 
96:3149-3153 (1999), and to be the dominating, if not 
sole, IGFBP-4 protease present in the circulation, 
Overgaard et al., J. Biol. Chem. 275:41128-31133 
(2000) . IGFBP-4 is one of six known inhibitors of IGF 
action in vitro; like other IGFBPs, cleavage of IGFBP-4 
has been shown to abolish its ability to inhibit IGF 
activity. The cleavage specificity of PAPP-A for 
IGFBP-4 implicates PAPP-A in normal and pathological 
physiology of insulin-like growth factor (IGF) . 
Overgaard et al . , J. Biol. Chem. 275: 31128-33 (2000). 



PAPP-A exists in pregnancy serum as a 
covalent, heterotetrameric 2:2 complex with the proform 
of eosinophil major basic protein (proMBP) ; pro-MBP 
appears to inhibit PAPP-A 1 s protease activity. 
5 Overgaard et a J . , J. Biol. Chem. 275:31128-33 (2000). 
Conversely, IGF appears to be a necessary cofactor or 
agonist for PAPP-A protease activity, leading to a 
feedback network controlling IGF availability. 

Reports in the literature of pregnancy- 

10 related proteins related in sequence to PAPP-A, termed 
PAPP-B, PAPP-C, and PAPP-D, have proven spurious. Farr 
et al., Biochim. Biophys. Acta 1493:356-362 (2000) 
recently identified a cDNA that encodes a protein 
related in primary sequence and protein domain 

15 structure to PAPP-A and that is expressed primarily in 
placenta, which they term PAPP-E . FARR et al . report 
that the cDNA encodes a complete open reading frame. 

Recent reports suggest that at least one- 
third, and likely a higher percentage, of human genes 

20 are alternatively spliced. Hanke et al . , Trends 

Genet. 15(1): 389 - 390 (1999); Mironov et al., Genome 
Res. 9:1288-93 (1999); Brett et al . , FEBS Lett. 
474(1) :83-6 (2000). Alternative splicing has been 
proposed to account for at least part of the difference 

25 between the number of genes recently called from the 
completed human genome draft sequence — 30,000 to 
40,000 (Genome International Sequencing Consortium, 
Nature 409:860-921 (15 February 2001) - and earlier 
predictions of human gene number that routinely ranged 

30 as high as 120,000, Liang et a J . , Nature Genet. 

25 (2) :239-240 (2000). With the Drosophila homolog of 
one human gene reported to have 38,000 potential 
alternatively spliced variants, Schmucker et al . , Cell 
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101:671 (2000), it now appears that alternative 
splicing may permit the relatively small number of 
human coding regions to encode millions, perhaps tens 
of millions, of structurally distinct proteins and 
5 protein isoforms. 

With increasing age, women experience 
decrease in ovarian reserve and, upon conception, an 
increased incidence of aneuploid gestations. Given a 
likely role of PAPP-A in controlling ovarian follicular 

10 maturation, and its proven clinical utility as a 

predictor of fetal abnormality during gestation, PAPP-A 
has potential therapeutic as well as diagnostic roles 
in clinical infertility practice. 

With the recent identification of a protein 

15 that is related to the clinically useful prenatal 

diagnostic marker, human PAPP-A, and the recognition 
that alternatively spliced isoforms of proteins are as 
critical to metabolic and physiologic function as 
proteins that are separately encoded, there is a need 

20 to identify and to characterize additional isoforms of 
the PAPP-E protein. 

SUMMARY OF THE INVENTION 

The present invention solves these and other 
needs in the art by providing isolated nucleic acids 

25 that encode three novel isoforms of hPAPP-E, and 
fragments thereof. 

In other aspects, the invention provides 
vectors for propagating and expressing the nucleic 
acids of the present invention, host cells comprising 

30 the nucleic acids and vectors of the present invention, 
proteins, protein fragments, and protein fusions of the 
novel PAPP-E isoforms, and antibodies thereto. 



The invention further provides pharmaceutical 
formulations of the nucleic acids, proteins, and 
antibodies of the present invention. 

In other aspects, the invention provides 
5 transgenic cells and non-human organisms comprising 
human PAPP-E isoform nucleic acids, and transgenic 
cells and non-human organisms with targeted disruption 
of the endogenous orthologue of the human PAPP-E gene. 

The invention additionally provides 
10 diagnostic, investigational, and therapeutic methods 
based on the PAPP-E nucleic acids, proteins, and 
antibodies of the present invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The above and other objects and advantages of 
15 the present invention will be apparent upon 

consideration of the following detailed description 
taken in conjunction with the accompanying drawings, in 
which like characters refer to like parts throughout, 
and in which: 

20 FIG. 1 schematizes the protein domain 

structure of the three novel isoforms of PAPP-E and the 
earlier-described PAPP-A; 

FIG. 2 is a map showing the genomic structure 
and alternative exon usage of three novel isoforms of 
25 human PAPP-E that are encoded at chromosome lq24.1, 
termed PAPP-Ea, PAPP-Eb, and PAPP-Ec; 

FIG. 3 presents the nucleotide and predicted 
amino acid sequences of PAPP-Ea; 

FIG. 4 presents the nucleotide and predicted 
3 0 amino acid sequences of PAPP-Eb; and 

FIG. 5 presents the nucleotide and predicted 
amino acid sequences of PAPP-Ec. 



DETAILED DESCRIPTION OF THE INVENTION 

Mining the sequence of the human genome for 
novel human genes, the present inventors have 
identified three novel isoforms of the recently cloned 
human pregnancy-associated protein E (PAPP-E) , a 
protein expressed predominantly in placenta and related 
to the clinically useful prenatal diagnostic marker, 
human PAP P - A . 

As schematized in FIG. 1, the newly isolated 
isoforms - PAPP-Ea, PAPP-Eb, and PAPP-Ec - share 
certain protein domains and an overall structural 
organization with PAPP-A; in conjunction with a pattern 
of expression strikingly similar to that of PAPP-A, 
with high level expression in placenta, the shared 
structural features strongly imply that the three 
PAPP-E isoforms play a role similar to that of PAPP-A 
in regulating the activity of a plasma borne growth 
factor (s), likely IGF, which in turn is important for 
maintenance of pregnancy and/or normal fetal 
development, thus making the PAPP-E isoforms clinically 
useful diagnostic markers and potential therapeutic 
agents . 

Like PAPP-A, all three novel isoforms have 
the zinc-binding domain ("zinc") characteristic of 
metzincin superfamily metalloproteases, defined by the 
degenerate motif "HEXXHXXGXXH", where invariant 
residues are shown underlined and variable residues are 
shown as "X". In PAPP-Ea, the longest isoform, the 
zinc binding domain occurs at residues 733 - 743 with 
sequence HE VGHVLGL YH . 

In common with PAPP-A, all three novel 
isoforms of PAPP-E have an at least four-fold 
repetition near the C-terminus of the short consensus 



repeat ("SCR"; alternatively denominated "sushi" 
domain) (relaxation of certain bioinf ormatic parameters 
causes bioinf ormatic algorithms to suggest a potential 
five-fold repetition) . 
5 In common with PAPP-A, all three novel 

isoforms of PAPP-E also have at least one "NL" (notch- 
lin, also termed lin notch repeat, or "LNR") domain, 
so-called due to its presence in Notch and Lin-12 
proteins, both of which proteins regulate early tissue 

10 differentiation. As shown in FIG. 1, PAPP-Ea possesses 
three NL domains in the same general spaced 
relationship to the zinc domain as is found in PAPP-A. 
PAPP-Eb, in contrast, lacks the C-terminal NL domain, 
whereas PAPPE-c, the shortest of the novel isoforms, 

15 lacks the two NL domains on the N-terminal side of the 
zinc-binding domain. 

The four-fold repetition of SCR ("sushi") 
domains is characteristic of complement proteins and 
selectins. Five-fold repetition of SCR domains with 

20 further presence of at least one NL domain has been 

previously identified in complement decay-accelerating 
factor and P-selectin. 

In contrast to PAPP-A, two of the novel 
isoforms of PAPP-E - PAPP-Ea and PAPP-Eb - have a 

25 laminin G domain. Laminin G domains are found in a 

number of extracellular and receptor proteins, and are 
implicated in interactions with cellular receptors 
(integrins, alpha-dystroglycan) , sulfated carbohydrates, 
and other extracellular ligands. 

30 In contrast to PAPP-A, all three novel 

isoforms of PAPP-E contain nuclear localization signals 
( "NLS" ) ; with concurrent presence of a leader sequence 
(not shown) , these signals suggest that all three PAPP- 



E isoforms can be secreted and also localize to the 
cell nucleus. 

FIG. 2 shows the genomic organization of the 
three PAPP-E isoforms. 

At the top is shown the four bacterial 
artificial chromosomes (BACs), with GenBank accession 
numbers, that span the PAPP-E locus. The genome- 
derived single-exon probe first used to demonstrate 
expression from this locus, as further described in 
commonly owned and copending provisional patent 
application no. 60/207,456, filed May 26, 2000, the 
disclosure of which is incorporated herein by reference 
in its entirety, is shown below the BACs and labeled 
"500". The 500 bp probe includes sequence drawn solely 
from exon two. 

As shown in FIG. 2, PAPP-Ea, encoding a 
protein of 1791 amino acids, is the longest PAPP-E 
isoform, comprising exons 1 - 20, 22 and 23. Insertion 
of the 8 5 bp exon 21 in PAPP-Eb leads to a downstream 
frame shift and earlier termination, thus encoding a 
protein of 1770 amino acids. PAPP-Ec lacks exons 2, 3 
and 21, encoding a protein of 1385 amino acids. 
Predicted molecular weights, prior to any post- 
translational modification, are 199 kD, 196 kD and 152 
kD, respectively. 

As further discussed in the examples herein, 
expression of PAPP-E was assessed using hybridization 
to genome-derived single exon microarrays and northern 
blot. Microarray analysis of the first two exons 
showed high level expression in placenta, and little 
expression in other tissues. This was confirmed by 
northern blot of 12 tissues (blood leukocyte, lung, 
placenta, small intestine, liver, kidney, spleen, 
thymus, colon, skeletal muscle, heart and brain) . 
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As more fully described below, the present 
invention provides isolated nucleic acids that encode 
each of the novel isoforms of hPAPP-E, and fragments 
thereof. The invention further provides vectors for 
5 propagation and expression of the nucleic acids of the 
present invention, host cells comprising the nucleic 
acids and vectors of the present invention, proteins, 
protein fragments, and protein fusions of the present 
invention, and antibodies specific for all or any one 

10 of the isoforms. The invention provides pharmaceutical 
formulations of the nucleic acids, proteins, and 
antibodies of the present invention. The invention 
further provides transgenic cells and non-human 
organisms comprising human PAPP-E isoform nucleic 

15 acids, and transgenic cells and non-human organisms 

with targeted disruption of the endogenous orthologue 
of the human PAPP-E gene. The invention additionally 
provides diagnostic, investigational, and therapeutic 
methods based on the PAPP-E nucleic acids, proteins, 

20 and antibodies of the present invention. 

DEFINITIONS 

As used herein, "nucleic acid" includes 
polynucleotides having natural nucleotides in native 
5 1 -3 1 phosphodiester linkage — e.g., DNA or RNA — as 

25 well as polynucleotides that have nonnatural nucleotide 
analogues, nonnative internucleoside bonds, or both, so 
long as the nonnatural polynucleotide is capable of 
sequence-discriminating basepairing under 
experimentally desired conditions. Unless otherwise 

30 specified, the term "nucleic acid" includes any 

topological conformation; the term thus explicitly 
comprehends single-stranded, double-stranded, partially 
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duplexed, triplexed, hairpinned, circular, and 
padlocked conformations. 

As used herein, an "isolated nucleic acid" is 
a nucleic acid molecule that exists in a physical form 
5 that is nonidentical to any nucleic acid molecule of 
identical sequence as found in nature; "isolated" does 
not require, although it does not prohibit, that the 
nucleic acid so described has itself been physically 
removed from its native environment. 

10 For example, a nucleic acid can be said to be 

"isolated" when it includes nucleotides and/or 
internucleoside bonds not found in nature. When 
instead composed of natural nucleosides in 
phosphodiester linkage, a nucleic acid can be said to 

15 be "isolated" when it exists at a purity not found in 
nature, where purity can be adjudged with respect to 
the presence of nucleic acids of other sequence, with 
respect to the presence of proteins, with respect to 
the presence of lipids, or with respect the presence of 

20 any other component of a biological cell, or when the 
nucleic acid lacks sequence that flanks an otherwise 
identical sequence in an organism's genome, or when the 
nucleic acid possesses sequence not identically present 
in nature. 

25 As so defined, "isolated nucleic acid" 

includes nucleic acids integrated into a host cell 
chromosome at a heterologous site, recombinant fusions 
of a native fragment to a heterologous sequence, 
recombinant vectors present as episomes or as 

30 integrated into a host cell chromosome. 

As used herein, an isolated nucleic acid 
"encodes" a reference polypeptide when at least a 
portion of the nucleic acid, or its complement, can be 
directly translated to provide the amino acid sequence 
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of the reference polypeptide, or when the isolated 
nucleic acid can be used, alone or as part of an 
expression vector, to express the reference polypeptide 
in vitro, in a prokaryotic host cell, or in a 
5 eukaryotic host cell. 

As used herein, the term "exon" refers to a 
nucleic acid sequence found in genomic DNA that is 
bioinf ormatically predicted and/or experimentally 
confirmed to contribute contiguous sequence to a mature 

10 mRNA transcript. 

As used herein, the phrase "open reading 
frame" and the equivalent acronym "ORF" refer to that 
portion of a transcript-derived nucleic acid that can 
be translated in its entirety into a sequence of 

15 contiguous amino acids. As so defined, an ORF has 

length, measured in nucleotides, exactly divisible by 
3. As so defined, an ORF need not encode the entirety 
of a natural protein. 

As used herein, the phrase "ORF-encoded 

20 peptide" refers to the predicted or actual translation 
of an ORF. 

As used herein, the phrase "degenerate 
variant" of a reference nucleic acid sequence intends 
all nucleic acid sequences that can be directly 

25 translated, using the standard genetic code, to provide 
an amino acid sequence identical to that translated 
from the reference nucleic acid sequence. 

As used herein, the term "microarray" and 
equivalent phrase "nucleic acid microarray" refer to a 

30 substrate-bound collection of plural nucleic acids, 

hybridization to each of the plurality of bound nucleic 
acids being separately detectable. The substrate can 
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be solid or porous, planar or non-planar, unitary or 
distributed. 

As so defined, the term "microarray" and 
phrase "nucleic acid microarray" include all the 
5 devices so called in Schena (ed.), DNA Microarravs: A 
Practical Approach (Practical Approach Series ) , Oxford 
University Press (1999) (ISBN: 0199637768); Nature 
Genet. 21 (1 ) (suppl) : 1 - 60 (1999); and Schena (ed.), 
Microarray Biochip: Tools and Technology , Eaton 

10 Publishing Company/BioTechniques Books Division (2000) 
(ISBN: 1881299376), the disclosures of which are 
incorporated herein by reference in their entireties. 

As so defined, the term "microarray" and 
phrase "nucleic acid microarray" also include 

15 substrate-bound collections of plural nucleic acids in 
which the plurality of nucleic acids are distributably 
disposed on a plurality of beads, rather than on a 
unitary planar substrate, as is described, inter alia, 
in Brenner et al., Proc. Natl. Acad. Sci . USA 

20 97(4) : 166501670 (2000), the disclosure of which is 

incorporated herein by reference in its entirety; in 
such case, the term "microarray" and phrase "nucleic 
acid microarray" refer to the plurality of beads in 
aggregate . 

25 As used herein with respect to solution phase 

hybridization, the term "probe", or equivalently, 
"nucleic acid probe" or "hybridization probe", refers 
to an isolated nucleic acid of known sequence that is, 
or is intended to be, detectably labeled. As used 

30 herein with respect to a nucleic acid microarray, the 
term "probe" (or equivalently "nucleic acid probe" or 
"hybridization probe") refers to the isolated nucleic 
acid that is, or is intended to be, bound to the 
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substrate. In either such context, the term "target" 
refers to nucleic acid intended to be bound to probe by 
sequence complementarity. 

As used herein, the expression "probe 
5 comprising SEQ ID NO:X", and variants thereof, intends 
a nucleic acid probe, at least a portion of which probe 
has either (i) the sequence directly as given in the 
referenced SEQ ID NO:X, or (ii) a sequence 
complementary to the sequence as given in the 

10 referenced SEQ ID NO:X, the choice as between sequence 
directly as given and complement thereof dictated by 
the requirement that the probe be complementary to the 
desired target. 

As used herein, the phrases "expression of a 

15 probe" and "expression of an isolated nucleic acid" and 
their linguistic equivalents intend that the probe or, 
respectively, the isolated nucleic acid, can hybridize 
detectably under high stringency conditions to a sample 
of nucleic acids that derive from mRNA from a given 

20 source. For example, and by way of illustration only, 
expression of a probe in "liver" means that the probe 
can hybridize detectably under high stringency 
conditions to a sample of nucleic acids that derive 
from mRNA obtained from liver. 

25 As used herein, the terms "protein", 

"polypeptide", and "peptide" are used interchangeably 
to refer to a naturally-occurring or synthetic polymer 
of amino acid monomers (residues) , irrespective of 
length, where amino acid monomer here includes 

30 naturally-occurring amino acids, naturally-occurring 
amino acid structural variants, and synthetic non- 
naturally occurring analogs that are capable of 
participating in peptide bonds. The terms "protein", 
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"polypeptide" , and "peptide" explicitly permits of 
post-translational and post-synthetic modifications, 
such as glycosylation. 

The term "oligopeptide" herein denotes a 
5 protein, polypeptide, or peptide having 25 or fewer 
monomer ic subunits . 

The phrases "isolated protein", "isolated 
polypeptide", "isolated peptide" and "isolated 
oligopeptide" refer to a protein (equally, to a 

10 polypeptide, peptide, or oligopeptide) that is 

nonidentical to any protein molecule of identical amino 
acid sequence as found in nature; "isolated" does not 
require, although it does not prohibit, that the 
protein so described has itself been physically removed 

15 from its native environment. 

For example, a protein can be said to be 
"isolated" when it includes amino acid analogues or 
derivatives not found in nature, or includes linkages 
other than standard peptide bonds. 

20 When instead composed entirely of natural 

amino acids linked by peptide bonds, a protein can be 
said to be "isolated" when it exists at a purity not 
found in nature — where purity can be adjudged with 
respect to the presence of proteins of other sequence, 

25 with respect to the presence of non-protein compounds, 
such as nucleic acids, lipids, or other components of a 
biological cell, or when it exists in a composition not 
found in nature, such as in a host cell that does not 
naturally express that protein. 

30 A "purified protein" (equally, a purified 

polypeptide, peptide, or oligopeptide) is an isolated 
protein, as above described, present at a concentration 
of at least 95%, as measured on a mass basis with 



- 17 - 



respect to total protein in a composition. A 
"substantially purified protein" (equally, a 
substantially purified polypeptide, peptide, or 
oligopeptide) is an isolated protein, as above 
5 described, present at a concentration of at least 70%, 
as measured on a mass basis with respect to total 
protein in a composition. 

As used herein, the phrase "protein isoforms" 
refers to a plurality of proteins having nonidentical 

10 primary amino acid sequence but that share amino acid 
sequence encoded by at least one common exon. 

As used herein, the phrase "alternative 
splicing" and its linguistic equivalents includes all 
types of RNA processing that lead to expression of 

15 plural protein isoforms from a single gene; 

accordingly, the phrase "splice variant (s) " and its 
linguistic equivalents embraces mRNAs transcribed from 
a given gene that, however processed, collectively 
encode plural protein isoforms. For example, and by 

20 way of illustration only, splice variants can include 
exon insertions, exon extensions, exon truncations, 
exon deletions, alternatives in the 5' untranslated 
region ("5 1 UT" ) and alternatives in the 3' 
untranslated region ("3 f UT"). Such 3' alternatives 

25 include, for example, differences in the site of RNA 

transcript cleavage and site of poly (A) addition. See, 
e.g., Gautheret et al . , Genome Res. 8:524-530 (1998). 

As used herein, "orthologues" are separate 
occurrences of the same gene in multiple species. The 

30 separate occurrences have similar, albeit nonidentical, 
amino acid sequences, the degree of sequence similarity 
depending, in part, upon the evolutionary distance of 
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the species from a common ancestor having the same 
gene . 

As used herein, the term "paralogues" 

indicates separate occurrences of a gene in one 
5 species. The separate occurrences have similar, albeit 
nonidentical, amino acid sequences, the degree of 
sequence similarity depending, in part, upon the 
evolutionary distance from the gene duplication event 
giving rise to the separate occurrences, 

10 As used herein, the term "homologues" is 

generic to "orthologues" and "paralogues". 

As used herein, the term "antibody" refers to 
a polypeptide, at least a portion of which is encoded 
by at least one immunoglobulin gene, or fragment 

15 thereof, and that can bind specifically to a desired 

target molecule. The term includes naturally-occurring 
forms, as well as fragments and derivatives. 

Fragments within the scope of the term 
include those produced by digestion with various 

20 proteases, those produced by chemical cleavage and/or 
chemical dissociation, and those produced 
recombinantly, so long as the fragment remains capable 
of specific binding to a target molecule. Among such 
fragments are Fab, Fab f , Fv, F(ab)' 2 , and single chain 

25 Fv (scFv) fragments. 

Derivatives within the scope of the term 
include antibodies (or fragments thereof) that have 
been modified in sequence, but remain capable of 
specific binding to a target molecule, including: 

30 interspecies chimeric and humanized antibodies; 

antibody fusions; heteromeric antibody complexes and 
antibody fusions, such as diabodies (bispecific 
antibodies) , single-chain diabodies, and intrabodies 



(see, e.g. , Marasco (ed.), Intracellular Antibodies: 
Research and Disease Applications , Springer-Verlag New 
York, Inc. (1998) (ISBN: 3540641513), the disclosure of 
which is incorporated herein by reference in its 
entirety) . 

As used herein, antibodies can be produced by 
any known technique, including harvest from cell 
culture of native B lymphocytes, harvest from culture 
of hybridomas, recombinant expression systems, and 
phage display. 

As used herein, "antigen" refers to a ligand 
that can be bound by an antibody; an antigen need not 
itself be immunogenic. The portions of the antigen 
that make contact with the antibody are denominated 
"epitopes" . 

"Specific binding" refers to the ability of 
two molecular species concurrently present in a 
heterogeneous (inhomogeneous) sample to bind to one 
another in preference to binding to other molecular 
species in the sample. Typically, a specific binding 
interaction will discriminate over adventitious binding 
interactions in the reaction by at least two-fold, more 
typically by at least 10-fold, often at least 100-fold; 
when used to detect analyte, specific binding is 
sufficiently discriminatory when determinative of the 
presence of the analyte in a heterogeneous 
(inhomogeneous) sample. Typically, the affinity or 
avidity of a specific binding reaction is least about 
10" 7 M, with specific binding reactions of greater 
specificity typically having affinity or avidity of at 
least 10~ 8 M to at least about 10~ 9 M. 

As used herein, "molecular binding 
partners" - and equivalent ly, "specific binding 
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partners" — refer to pairs of molecules/ typically 
pairs of biomolecules, that exhibit specific binding. 
Nonlimiting examples are receptor and ligand, antibody 
and antigen, and biotin to any of avidin, streptavidin, 
5 neutrAvidin and captAvidin. 

NUCLEIC ACID MOLECULES 

In a first aspect, the invention provides 
isolated nucleic acids that encode three novel isoforms 
of the PAPP-E protein, variants having at least 90% 

10 sequence identity thereto, degenerate variants thereof, 
variants that encode PAPP-E proteins having 
conservative or moderately conservative substitutions, 
cross-hybridizing nucleic acids, and fragments thereof, 
FIGS. 3, 4, and 5 present the nucleotide 

15 sequences of PAPP-Ea, PAPP-Eb, and PAPP-Ec cDNA clones, 
with predicted amino acid translations; the nucleotide 
sequences are further presented, respectively, in SEQ 
ID NOs:l (full length nucleotide sequence of PAPP-Ea 
cDNA) , 3 (full length amino acid coding sequence of 

20 PAPP-Ea), 8 (nucleotide sequence encoding the entirety 
of PAPP-Eb), 10 (full length amino acid coding sequence 
of PAPP-Eb) , 15 (nucleotide sequence encoding the 
entirety of PAPP-Ec), and 16 (full length amino acid 
coding sequence of PAPP-Ec) . 

25 Unless otherwise indicated, each nucleotide 

sequence is set forth herein as a sequence of 
deoxyribonucleotides . It is intended, however, that 
the given sequence be interpreted as would be 
appropriate to the polynucleotide composition: for 

30 example, if the isolated nucleic acid is composed of 
RNA, the given sequence intends ribonucleotides, with 
uridine substituted for thymidine. 
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Unless otherwise indicated, nucleotide 
sequences of the isolated nucleic acids of the present 
invention were determined by sequencing a DNA molecule 
that had resulted, directly or indirectly, from at 
5 least one enzymatic polymerization reaction (e.g., 

reverse transcription and/or polymerase chain reaction) 
using an automated sequencer (such as the 
MegaBACE™ 1000, Molecular Dynamics, Sunnyvale, CA, 
USA) , or by reliance upon such sequence or upon genomic 

10 sequence prior-accessioned into a public database. 

Unless otherwise indicated, all amino acid sequences of 
the polypeptides of the present invention were 
predicted by translation from the nucleic acid 
sequences so determined. 

15 As a consequence, any nucleic acid sequence 

presented herein may contain errors introduced by 
erroneous incorporation of nucleotides during 
polymerization, by erroroneous base calling by the 
automated sequencer (although such sequencing errors 

20 have been minimized for the nucleic acids directly 

determined herein, unless otherwise indicated, by the 
sequencing of each of the complementary strands of a 
duplex DNA) , or by similar errors accessioned into the 
public database. 

25 Accordingly, each of PAPP-Ea, PAPP-Eb, and 

PAPP-Ec cDNA clones described herein has been deposited 
in a public repository (American Type Culture 
Collection, Manassas, Virginia, USA) under accession 
numbers (PAPP-Ea), (PAPP-Eb), 

30 (PAPP-Ec) . Any errors in sequence reported 

herein can be determined and corrected by sequencing 
nucleic acids propagated from the deposited clones 
using standard techniques. 
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Single nucleotide polymorphisms (SNPs) occur 
frequently in eukaryotic genomes - more than 1.4 
million SNPs have already identified in the human 
genome, International Human Genome Sequencing 
5 Consortium, Nature 409:860 - 921 (2001) - and the 

sequence determined from one individual of a species 
may differ from other allelic forms present within the 
population. Additionally, small deletions and 
insertions, rather than single nucleotide 
10 polymorphisms, are not uncommon in the general 

population, and often do not alter the function of the 
protein. 

Accordingly, it is an aspect of the present 
invention to provide nucleic acids not only identical 

15 in sequence to those described with particularity 

herein, but also to provide isolated nucleic acids at 
least about 90% identical in sequence to those 
described with particularity herein, typically at least 
about 91%, 92%, 93%, 94%, or 95% identical in sequence 

20 to those decribed with particularity herein, usefully 
at least about 96%, 97%, 98%, or 99% identical in 
sequence to those described with particularity herein, 
and, most conservatively, at least about 99*5%, 99.6%, 
99.7%, 99.8% and 99.9% identical in sequence to those 

25 described with particularity herein. These sequence 

variants can be naturally occurring or can result from 
human intervention, as by random or directed 
mutagenesis . 

For purposes herein, percent identity of two 

30 nucleic acid sequences is determined using the 

procedure of Tatiana et al . , "Blast 2 sequences - a new 
tool for comparing protein and nucleotide sequences", 
FEMS Microbiol Lett. 174:247-250 (1999), which 



procedure is effectuated by the computer program BLAST 
2 SEQUENCES, available online at 



http : //www . ncbi . nlm. nih . gov/blast/bl2seq/bl2 . html . 

To assess percent identity of nucleic acids, the BLASTN 
module of BLAST 2 SEQUENCES is used with default values 
of (i) reward for a match: 1; (ii) penalty for a 
mismatch: -2; (iii) open gap 5 and extension gap 2 
penalties; (iv) gap X_dropoff 50 expect 10 word size 11 
filter, and both sequences are entered in their 
entireties. 

As is well known, the genetic code is 
degenerate, with each amino acid except methionine 
translated from a plurality of codons, thus permitting 
a plurality of nucleic acids of disparate sequence to 
encode the identical protein. As is also well known, 
codon choice for optimal expression varies from species 
to species. The isolated nucleic acids of the present 
invention being useful for expression of PAPP-E 
proteins and protein fragments, it is, therefore, 
another aspect of the present invention to provide 
isolated isolated nucleic acids that encode PAPP-E 
isoforms, and portions thereof, not only identical in 
sequence to those described with particularity herein, 
but degenerate variants thereof as well. 

As is also well known, amino acid 
substitutions occur frequently among natural allelic 
variants, with conservative substitutions often 
occasioning only de minimis change in protein function. 

Accordingly, it is an aspect of the present 
invention to provide nucleic acids not only identical 
in sequence to those described with particularity 
herein, but also to provide isolated nucleic acids that 
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encode PAPP-E isoforms, and portions thereof, having 
conservative amino acid substitutions, and also to 
provide isolated nucleic acids that encode PAPP-E 
isoforms, and portions thereof, having moderately 
5 conservative amino acid substitutions. 

Although there are a variety of metrics for 
calling conservative amino acid substitutions, based 
primarily on either observed changes . among 
evolutionarily related proteins or on predicted 
10 chemical similarity, for purposes herein a conservative 
replacement is any change having a positive value in 
the PAM250 log-likelihood matrix reproduced herein 
below (see Gonnet et al . , Science 256 (5062) : 1443-5 
(1992) ) : 
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V 0 -2 -2 -3 0 -2 -2-3-2 3 2-2 2 0 -2 -1 0-3-1 3 

For purposes herein, a "moderately conservative" 
replacement is any change having a nonnegative value in 
the PAM250 log-likelihood matrix reproduced herein 
5 above . 

As is also well known in the art, relatedness 
of nucleic acids can also be characterized using a 
functional test, the ability of the two nucleic acids 
to base-pair to one another at defined hybridization 

10 stringencies. 

It is, therefore, another aspect of the 
invention to provide isolated nucleic acids not only 
identical in sequence to those described with 
particularity herein, but also to provide isolated 

15 nucleic acids ("cross-hybridizing nucleic acids") that 
hybridize under high stringency conditions (as defined 
herein below) to all or to a portion of various of the 
isolated PAPP-E nucleic acids of the present invention 
("reference nucleic acids"), as well as cross- 

20 hybridizing nucleic acids that hybridize under moderate 
stringency conditions to all or to a portion of various 
of the isolated PAPP-E nucleic acids of the present 
invention. 

Such cross-hybridizing nucleic acids are 
25 useful, inter alia, as probes for, and to drive 

expression of, proteins related to the proteins of the 
present invention as alternative i so forms , homologues, 
paralogues, and orthologues. Particularly preferred 
orthologues are those from other primate species, such 
30 as chimpanzee, rhesus macaque, baboon, and gorilla, 

from rodents, such as rats, mice, guinea pigs, and from 
livestock, such as cow, pig, sheep, horse, goat. 
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For purposes herein, high stringency 
conditions are defined as aqueous hybridization (i.e., 
free of formamide) in 6X SSC (where 2 OX SSC contains 
3,0 M NaCl and 0.3 M sodium citrate), 1% SDS at 65°C for 
5 at least 8 hours, followed by one or more washes in 
0.2X SSC, 0.1% SDS at 65°C. For purposes herein, 
moderate stringency conditions are defined as aqueous 
hybridization (i.e., free of formamide) in 6X SSC, 1% 
SDS at 65°C for at least 8 hours, followed by one or 

10 more washes in 2x SSC, 0.1% SDS at room temperature. 

The hybridizing portion of the reference 
nucleic acid is typically at least 15 nucleotides in 
length, often at least 17 nucleotides in length. 
Often, however, the hybridizing portion of the 

15 reference nucleic acid is at least 20 nucleotides in 
length, 25 nucleotides in length, and even 30 
nucleotides, 35 nucleotides, 40 nucleotides, and 50 
nucleotides in length. Of course, cross-hybridizing 
nucleic acids that hybridize to a larger portion of the 

20 reference nucleic acid - for example, to a portion of 
at least 50 nt, at least 100 nt, at least 150 nt, 200 
nt, 250 nt, 300 nt, 350 nt, 400 nt, 450 nt, or 500 nt 
or more - or even to the entire length of the reference 
nucleic acid, are also useful. 

25 The hybridizing portion of the cross- 

hybridizing nucleic acid is at least 75% identical in 
sequence to at least a portion of the reference nucleic 
acid. Typically, the hybridizing portion of the cross- 
hybridizing nucleic acid is at least 80%, often at 

30 least 85%, 86%, 87%, 88%, 89% or even at least 90% 
identical in sequence to at least a portion of the 
reference nucleic acid. Often, the hybridizing portion 
of the cross-hybridizing nucleic acid will be at least 
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% 
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identical in sequence to at least a portion of the 
reference nucleic acid sequence. At times, the 
hybridizing portion of the cross-hybridizing nucleic 
acid will be at least 99,5% identical in sequence to at 
5 least a portion of the reference nucleic acid. 

The invention also provides fragments of 
various of the isolated nucleic acids of the present 
invention. 

By "fragments" of a reference nucleic acid is 

10 here intended isolated nucleic acids, however obtained, 
that have a nucleotide sequence identical to a portion 
of the reference nucleic acid sequence, which portion 
is at least 17 nucleotides and less than the entirety 
of the reference nucleic acid. As so defined, 

15 "fragments" need not be obtained by physical 

fragmentation of the reference nucleic acid, although 
such provenance is not thereby precluded. 

In theory, an oligonucleotide of 17 
nucleotides is of sufficient length as to occur at 

20 random less frequently than once in the three gigabase 
human genome, and thus to provide a nucleic acid probe 
that can uniquely identify the reference sequence in a 
nucleic acid mixture of genomic complexity. As is well 
known, further specificity can be obtained by probing 

25 nucleic acid samples of subgenomic complexity, and/or 

by using plural fragments as short as 17 nucleotides in 
length collectively to prime amplification of nucleic 
acids, as, e.g., by polymerase chain reaction (PCR) . 

As further described herein below, nucleic 

30 acid fragments that encode at least 6 contiguous amino 
. acids (i.e., fragments of 18 nucleotides or more) are 
useful in directing the expression or the synthesis of 
peptides that have utility in mapping the epitopes of 
the protein encoded by the reference nucleic acid. 
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See, e.g., Geysen et al., "Use of peptide synthesis to 
probe viral antigens for epitopes to a resolution of a 
single amino acid," Proc. Natl. Acad. Sci. USA 81:3998- 
4002 (1984); and U.S. Pat. Nos. 4,708,871 and 
5 5,595,915, the disclosures of which are incorporated 
herein by reference in their entireties. 

As further described herein below, fragments 
that encode at least 8 contiguous amino acids (i.e., 
fragments of 24 nucleotides or more) are useful in 

10 directing the expression or the synthesis of peptides 
that have utility as immunogens . See, e.g., Lerner, 
"Tapping the immunological repertoire to produce 
antibodies of predetermined specificity, " Nature 
299:592-596 (1982); Shinnick et al . , "Synthetic peptide 

15 immunogens as vaccines," Annu. Rev. Microbiol. 

37:425-46 (1983); Sutcliffe et al . , "Antibodies that 
react with predetermined sites on proteins, " Science 
219:660-6 (1983), the disclosures of which are 
incorporated herein by reference in their entireties. 

20 The nucleic acid fragment of the present 

invention is thus at least 17 nucleotides in length, 
typically at least 18 nucleotides in length, and often 
at least 24 nucleotides in length. Often, the nucleic 
acid of the present invention is at least 25 

25 nucleotides in length, and even 30 nucleotides, 35 
nucleotides, 40 nucleotides, or 45 nucleotides in 
length. Of course, larger fragments having at least 
50 nt, at least 100 nt, at least 150 nt, .200 nt, 250 
nt, 300 nt, 350 nt, 400 nt, 450 nt, or 500 nt or more 

30 are also useful, and at times preferred. 

Having been based upon the mining of genomic 
sequence, rather than upon surveillance of expressed 
message, the present invention further provides 
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isolated genome-derived nucleic acids that include 
portions of the PAPP-E gene. 

The invention particularly provides genome- 
derived single exon probes. 
5 As further described in commonly owned and 

copending U.S. patent application serial nos . 
09/774,203, filed January 29, 2001 and 09/632,366, 
filed August 3, 2000, and provisional U.S. patent 
application nos. 60/236,359, filed May 26, 2000 and 

10 60/236,359, filed September 27, 2000, the disclosures 
of which are incorporated herein by reference in their 
entireties, single exon probes comprise a portion of no 
more than one exon of the reference gene; the exonic 
portion is of sufficient length to hybridize under high 

15 stringency conditions to transcript-derived nucleic 

acids — such as mRNA or cDNA — that contain the exon or 
a portion thereof. 

Genome-derived single exon probes typically 
further comprise, contiguous to a first end of the exon 

20 portion, a first intronic and/or intergenic sequence 
that is identically contiguous to the exon in the 
genome. Often, the genome-derived single exon probe 
further comprises, contiguous to a second end of the 
exonic portion, a second intronic and/or intergenic 

25 sequence that is identically contiguous to the exon in 
the genome . 

The minimum length of genome-derived single 
exon probes is defined by the requirement that the 
exonic portion be of sufficient length to hybridize 
30 under high stringency conditions to transcript-derived 
nucleic acids. Accordingly, the exon portion is at 
least 17 nucleotides, typically at least 18 
nucleotides, 20 nucleotides, 24 nucleotides, 25 
nucleotides or even 30, 35, 40, 45, or 50 nucleotides 
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in length, and can usefully include the entirety of the 
exon, up to 100 nt, 150 nt, 200 nt, 250 nt, 300 nt, 350 
nt, 400 nt or even 500 nt or more in length. 

The maximum length of genome-derived single 
5 exon probes is defined by the requirement that the 
probes contain portions of. no more than one exon. 
Given variable spacing of exons through eukaryotic 
genomes, the maximum length is typically no more than 
25 kb, often no more than 20 kb, 15 kb, 10 kb or 7.5 

10 kb, or even no more than 5 kb, 4 kb, 3 kb, or even no 
more than about 2.5 kb in length. 

Genome-derived single exon probes can 
usefully include at least a first terminal priming 
sequence not found in contiguity with the rest of the 

15 probe sequence in the genome, and often will contain a 
second terminal priming sequence not found in 
contiguity with the rest of the probe sequence in the 
genome . 

The present invention also provides isolated 
20 genome-derived nucleic acids that include nucleic acid 
sequence elements that control transcription of the 
PAPP-E gene and its various isoforms. 

The isolated nucleic acids of the present 
invention can be composed of natural nucleotides in 
25 native 5 f -3' phosphodiester internucleoside linkage — 
e.g., DNA or RNA — or can contain any or all of 
nonnatural nucleotide analogues, nonnative 
internucleoside bonds, or post-synthesis modifications, 
either throughout the length of the nucleic acid or 
30 localized to one or more portions thereof. As is well 
known in the art, when the isolated nucleic acid is 
used as a hybridization probe, the range of such 
nonnatural analogues, nonnative internucleoside bonds, 
or post-synthesis modifications will be limited to 
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those that permit sequence-discriminating basepairing 
of the resulting nucleic acid. When used to direct 
expression or RNA or protein in vitro or in vivo, the 
range of such nonnatural analogues, nonnative 
5 internucleoside bonds, or post-synthesis modifications 
will be limited to those that permit the nucleic acid 
to function properly as a polymerization substrate. 
When the isolated nucleic acid is used as a therapeutic 
agent, the range of such changes will be limited to 

10 those that do not confer toxicity upon the isolated 
nucleic acid. 

For example, when desired to be used as 
probes, the isolated nucleic acids of the present 
invention can usefully include nucleotide analogues 

15 that incorporate labels that are directly detectable, 
such as radiolabels or f luorophores, or nucleotide 
analogues that incorporate labels that can be 
visualized in a subsequent reaction, such as biotin or 
various haptens. 

2 0 Common radiolabeled analogues include those 

labeled with 33 P, 32 P, and 35 S, such as a- 32 P-dATP, a- 32 P- 
dCTP, a- 32 P-dGTP, a- 32 P-dTTP, a- 32 P-3 T dATP, a- 32 P-ATP, a- 
32 P-CTP, a- 32 P-GTP, a- 32 P-UTP, a- 35 S-dATP, y~ 35 S-GTP, y 
33P-dATP, and the like. 

25 Commercially available fluorescent nucleotide 

analogues readily incorporated into the nucleic acids 
of the present invention include Cy3-dCTP, Cy3-dUTP, 
Cy5-dCTP, Cy3-dUTP (Amersham Pharmacia Biotech, 
Piscataway, New Jersey, USA), f luorescein-12-dUTP, 

30 tetramethylrhodamine-6-dUTP, Texas Red®-5-dUTP, Cascade 
Blue®-7-dUTP, BODIPY® FL-14-dUTP, BODIPY® TMR-14-dUTP, 
BODIPY® TR-14-dUTP, Rhodamine Green™-5-dUTP, Oregon 
Green® 488-5-dUTP, Texas Red®-12-dUTP, BODIPY® 
630/650-14-dUTP, BODIPY® 650/665-14-dUTP, Alexa Fluor® 
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488-5-dUTP, Alexa Fluor® 532-5-dUTP, Alexa Fluor® 
568-5-dUTP, Alexa Fluor® 594-5-dUTP, Alexa Fluor® 
54 6-14-dUTP, f luorescein-12-UTP, 

tetramethylrhodamine-6-UTP, Texas Red®-5-UTP, Cascade 
5 Blue®-7-UTP, BODIPY® FL-14-UTP, BODIPY® TMR-14-UTP, 

BODIPY® TR-14-UTP, Rhodamine Green™-5-UTP, Alexa Fluor® 
488-5-UTP, Alexa Fluor® 546-14-UTP (Molecular Probes, 
Inc. Eugene, OR, USA) . 

Protocols are available for custom synthesis 

10 of nucleotides having other f luorophores . Henegariu et 
al., "Custom Fluorecent-Nucleotide Synthesis as an 
Alternative Method for Nucleic Acid Labeling, " Nature 
Biotechnol. 18:345 - 348 (2000), the disclosure of 
which is incorporated herein by reference in its 

15 entirety. 

Haptens that are commonly conjugated to 
nucleotides for subsequent labeling include biotin 
(biotin-ll-dUTP, Molecular Probes, Inc., Eugene, OR, 
USA; biotin-21-UTP, biot in-2 1-dUTP, Clontech 

20 Laboratories, Inc., Palo Alto, CA, USA), digoxigenin 
(DIG-ll-dUTP, alkali labile, DIG-11-UTP, Roche 
Diagnostics Corp., Indianapolis, IN, USA), and 
dinitrophenyl (dinitrophenyl-ll-dUTP, Molecular Probes, 
Inc., Eugene, OR, USA). 

25 As another example, when desired to be used 

for antisense inhibition of translation, the isolated 
nucleic acids of the present invention can usefully 
include altered, often nuclease-resistant, 
internucleoside bonds. See Hartmann et ai . (eds.), 

30 Manual of Antisense Methodology (Perspectives in 

Antisense Science), Kluwer Law International (1999) 
(ISBN:079238539X) ; Stein et al . (eds.), Applied 
Antisense Oligonucleotide Technology , Wiley-Liss (cover 
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(1998) (ISBN: 0471172790); Chadwick et al . (eds.), 
Oligonucleotides as Therapeutic Agents - Symposium No. 
209 , John Wiley & Son Ltd (1997) (ISBN: 0471972797), or 
for targeted gene correction, Gamper et a 1 . , Nucl. 
5 Acids Res. 28 (21 ): 4332-9 (2000), the disclosures of 
which are incorporated herein by reference in their 
entireties . 

Modified oligonucleotide backbones often 
preferred when the nucleic acid is to be used for 

10 antisense purposes are, for example, phosphorothioates, 
chiral phosphorothioates, phosphorodithioates, 
phosphotriesters, aminoalkylphosphotriesters, methyl 
and other alkyl phosphonates including 3 T -alkylene 
phosphonates and chiral phosphonates, phosphinates, 

15 phosphoramidates including 3 1 -amino phosphoramidate and 
aminoalkylphosphoramidates, thionophosphoramidates, 
thionoalkylphosphonates, thionoalkylphosphotriesters, 
and boranophosphates having normal 3' -5' linkages, 
2 f -5' linked analogs of these, and those having 

20 inverted polarity wherein the adjacent pairs of 

nucleoside units are linked 3 f -5' to 5' -3' or 2 f -5' to 
5' -2'. Representative U.S. patents that teach the 
preparation of the above phosphorus-containing linkages 
include, but are not limited to, U.S. Pat. Nos. 

25 3,687,808; 4,469,863; 4,476,301; 5,023,243; 5,177,196; 
5,188,897; 5,264,423; 5,276,019; 5,278,302; 5,286,717; 
5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 
5,466,677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 
5,550,111; 5,563,253; 5,571,799; 5,587,361; and 

30 5,625,050, the disclosures of which are incorporated 
herein by reference in their entireties. 

Preferred modified oligonucleotide backbones 
for antisense use that do not include a phosphorus atom 
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have backbones that are formed by short chain alkyl or 
cycloalkyl internucleoside linkages, mixed heteroatom 
and alkyl or cycloalkyl internucleoside linkages, or 
one or more short chain heteroatomic or heterocyclic 
5 internucleoside linkages. These include those having 
morpholino linkages (formed in part from the sugar 
portion of a nucleoside) ; siloxane backbones; sulfide, 
sulfoxide and sulfone backbones; formacetyl and 
thiof ormacetyl backbones; methylene formacetyl and 
10 thiof ormacetyl backbones; alkene containing backbones; 
sulfamate backbones; methyleneimino and 

methylenehydrazino backbones; sulfonate and sulfonamide 

backbones; amide backbones; and others having mixed N, 

O, S and CH 2 component parts. Representative U.S. 
15 patents that teach the preparation of the above 

backbones include, but are not limited to, U.S. Pat. 

Nos. 5,034,506; 5,166,315; 5,185,444; 5,214,134; 

5,216,141; 5,235,033; 5,264,562; 5,264,564; 5,405,938; 

5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 
20 5,561,225; 5,596,086; 5,602,240; 5,610,289; 5,602,240; 

5,608,046; 5,610,289; 5,618,704; 5,623,070; 5,663,312; 

5,633,360; 5,677,437; and 5,677,439, the disclosures of 

which are incorporated herein by reference in their 

entireties . 

25 In other preferred oligonucleotide mimetics, 

both the sugar and the internucleoside linkage are 
replaced with novel groups, such as peptide nucleic 
acids (PNA) . 

In PNA compounds, the phosphodiester backbone 

30 of the nucleic acid is replaced with an amide- 
containing backbone, in particular by repeating 
N- (2-aminoethyl) glycine units linked by amide bonds. 
Nucleobases are bound directly or indirectly to aza 



- 35 - 



nitrogen atoms of the amide portion of the backbone, 
typically by methylene carbonyl linkages. 

The uncharged nature of the PNA backbone 
provides PNA/DNA and PNA/RNA duplexes with a higher 
5 thermal stability than is found in DNA/DNA and DNA/RNA 
duplexes, resulting from the lack of charge repulsion 
between the PNA and DNA or RNA strand. In general, the 
Tm of a PNA/DNA or PNA/RNA duplex is 1°C higher per 
base pair than the Tm of the corresponding DNA/DNA or 

10 DNA/RNA duplex (in 100 mM NaCl) . 

The neutral backbone also allows PNA to form 
stable DNA duplexes largely independent of salt 
concentration. At low ionic strength, PNA can be 
hybridized to a target sequence at temperatures that 

15 make DNA hybridization problematic or impossible. And 
unlike DNA/DNA duplex formation, PNA hybridization is 
possible in the absence of magnesium. Adjusting the 
ionic strength, therefore, is useful if competing DNA 
or RNA is present in the sample, or if the nucleic acid 

20 being probed contains a high level of secondary 
structure . 

PNA also demonstrates greater specificity in 
binding to complementary DNA. A PNA/DNA mismatch is 
more destabilizing than DNA/DNA mismatch. A single 

2 5 mismatch in mixed a PNA/DNA 15-mer lowers the Tm by 
8-20 'C (15 'C on average). In the corresponding 
DNA/DNA duplexes, a single mismatch lowers the Tm by 
4-16 *C (11 'C on average) . Because PNA probes can be 
significantly shorter than DNA probes, their 

30 specificity is greater. 

Additionally, nucleases and proteases do not 
recognize the PNA polyamide backbone with nucleobase 
sidechains. As a result, PNA oligomers are resistant 
to degradation by enzymes, and the lifetime of these 



- 36 - 



compounds is extended both in vivo and in vitro. In 
addition, PNA is stable over a wide pH range. 

Because its backbone is formed from amide 
bonds, PNA can be synthesized using a modified peptide 
5 synthesis protocol. PNA oligomers can be synthesized 
by both Fmoc and tBoc methods. Representative U.S. 
patents that teach the preparation of PNA compounds 
include, but are not limited to, U.S. Pat. Nos. 
5,539,082; 5,714,331; and 5,719,262, each of which is 

10 herein incorporated by reference; automated PNA 
synthesis is readily achievable on commercial 
synthesizers (see, e.g., "PNA User's Guide," Rev. 2, 
February 1998, Perseptive Biosystems Part No. 60138, 
Applied Biosystems, Inc., Foster City, CA) . 

15 PNA chemistry and applications are reviewed, 

inter alia, in Ray et al . , FASEB J. 14 ( 9) : 1041-60 
(2000); Nielsen et al . , Pharmacol Toxicol. 86(1) : 3-7 
(2000); Larsen et al . , Biochim Biophys Acta. 
1489 (1) : 159-66 (1999); Nielsen, Curr. Opin. Struct. 

20 Biol. 9(3):353-7 (1999), and Nielsen, Curr. Opin. 

Biotechnol. 10(l):71-5 (1999), the disclosures of which 
are incorporated herein by reference in their 
entireties . 

Differences from nucleic acid compositions 
25 found in nature — e.g., nonnative bases, altered 

internucleoside linkages, post-synthesis modification — 
can be present throughout the length of the nucleic 
acid or can, instead, usefully be localized to discrete 
portions thereof. As an example of the latter, 
30 chimeric nucleic acids can be synthesized that have 

discrete DNA and RNA domains and demonstrated utility 
for targeted gene repair, as further described in U.S. 
Pat. Nos. 5,760,012 and 5,731,181, the disclosures of 
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which are incorporated herein by reference in their 
entireties. As another example, chimeric nucleic acids 
comprising both DNA and PNA have been demonstrated to 
have utility in modified PCR reactions. See Misra et 
5 al., Biochem. 37: 1917-1925 (1998); see also Finn et 
al. t Nucl. Acids Res. 24: 3357-3363 (1996), 
incorporated herein by reference. 

Unless otherwise specified, nucleic acids of 
the present invention can include any topological 
10 conformation appropriate to the desired use; the term 
thus explicitly comprehends, among others, single- 
stranded, double-stranded, triplexed, quadruplexed, 

partially double-stranded, partially-triplexed, 

i 

partially-quadruplexed, branched, hairpinned, circular, 

15 and padlocked conformations. Padlock conformations and 
their utility are further described in Baner et al., 
Curr. Opin. Blotechnol. 12:11-15 (2001); Escude et al . , 
Proc. Natl. Acad. Sci . USA 14; 96 ( 19) : 10603-7 (1999); 
Nilsson et al., Science 265 (5181 ): 2085-8 (1994), the 

20 disclosures of which are incorporated herein by 

reference in their entireties. Triplex and quadruplex 
conformations, and their utility, are reviewed in 
Praseuth et al., Biochim. Biophys. Acta. 
.1489 (1) : 181-206 (1999); Fox, Curr. Med. Chem. 

25 7(1): 17-37 (2000); Kochetkova et al . , Methods Mol . 
Biol. 130:189-201 (2000); Chan et al . , J. Mol. Med. 
75(4):267-82 (1997), the disclosures of which are 
incorporated herein by reference in their entireties. 

The nucleic acids of the present invention 

30 can be detectably labeled. Commonly-used labels 

include radionuclides, such as 32 P, 33 P, 35 S, 3 H (and for 
nmr detection, 13 C and 15 N) , haptens that can be detected 
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by specific antibody or high affinity binding partner 
(such as avidin) , and f luorophores . 

As noted above, detectable labels can be 
incorporated by inclusion of labeled nucleotide 
5 analogues in the nucleic acid. Such analogues can be 
incorporated by enzymatic polymerization, such as by 
nick translation, random priming, polymerase chain 
reaction (PCR) , terminal transferase tailing, and end- 
filling of overhangs, for DNA molecules, and in vitro 

10 transcription driven, e.g., from phage promoters, such 
as T7, T3, and SP6, for RNA molecules. Commercial kits 
are readily available for each such labeling approach. 

Analogues can also be incorporated during 
automated solid phase chemical synthesis. 

15 As is well known, labels can also be 

incorporated after nucleic acid synthesis, with the 5 1 
phosphate and 3' hydroxyl providing convenient sites 
for post-synthetic covalent attachment of detectable 
labels . 

20 Various other post-synthetic approaches 

permit internal labeling of nucleic acids. 

For example, fluorophores can be attached 
using a cisplatin reagent that reacts with the N7 of 
guanine residues (and, to a lesser extent, adenine 

25 bases) in DNA, RNA, and PNA to provide a stable 

coordination complex between the nucleic acid and 
fluorophore label (Universal Linkage System) (available 
from Molecular Probes, Inc., Eugene, OR, USA and 
Amersham Pharmacia Biotech, Piscataway, NJ, USA) ; see 

30 Alers et al., Genes, Chromosomes & Cancer, Vol. 25, pp. 
301 - 305 (1999); Jelsma et al., J. NIH Res. 5:82 
(1994); Van Belkum et al . , BioTechniques 16:148-153 
(1994), incorporated herein by reference. As another 
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example, nucleic acids can be labeled using a 
disulf ide-containing linker (FastTag™ Reagent, Vector 
Laboratories, Inc., Burlingame, CA, USA) that is photo- 
or thermally coupled to the target nucleic acid using 
.5 aryl azide chemistry; after reduction, a free thiol is 
available for coupling to a hapten, fluorophore, sugar, 
affinity ligand, or other marker. 

Multiple independent or interacting labels 
can be incorporated into the nucleic acids of the 

10 present invention. For example, both a fluorophore and 
a moiety that in proximity thereto acts to quench 
fluorescence can be included to report specific 
hybridization through release of fluorescence 
quenching, Tyagi et al., Nature Biotechnol . 14: 303-308 

15 (1996); Tyagi et al . , Nature Biotechnol. 16, 49-53 

(1998); Sokol et al., Proc. Natl. Acad. Sci . USA 95: 
11538-11543 (1998); Kostrikis et al., Science 
279:1228-1229 (1998); Marras et al . , Genet. Anal. 14: 
151-156 (1999); U.S. Pat. Nos . 5,846,726, 5,925,517, 

20 5925517, or to report exonucleotidic excision, U.S. 

Pat. No. 5,538,848; Holland et al . , Proc. Natl. Acad. 
Sci. USA 88:7276-7280 (1991); Heid et a J . , Genome Res. 
6(10): 986-94 (1996); Kuimelis et al., Nucleic Acids »" 
Symp Ser. (37):255-6 (1997); U.S. Patent No. 5,723,591, 

25 the disclosures of which are incorporated herein by 
reference in their entireties. 

So labeled, the isolated nucleic acids of the 
present invention can be used as probes, as further 
described below. 

30 Nucleic acids of the present invention can 

also usefully be bound to a substrate. The substrate 
can porous or solid, planar or non-planar, unitary or 
distributed; the bond can be covalent or noncovalent. 
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Bound to a substrate, nucleic acids of the present 
invention can be used as probes in their unlabeled 
state . 

For example, the nucleic acids of the present 
5 invention can usefully be bound to a porous substrate, 
commonly a membrane, typically comprising 
nitrocellulose, nylon, or positively-charged 
derivatized nylon; so attached, the nucleic acids of 
the present invention can be used to detect PAPP-E 

10 nucleic acids present within a labeled nucleic acid 

sample, either a sample of genomic nucleic acids or a 
sample of transcript-derived nucleic acids, e.g. by 
reverse dot blot. 

The nucleic acids of the present invention 

15 can also usefully be bound to a solid substrate, such 
as glass, although other solid materials, such as 
amorphous silicon, crystalline silicon, or plastics, 
can also be used- Such plastics include 
polymethylacrylic, polyethylene, polypropylene, 

2 0 polyacrylate, polymethylmethacrylate, 

polyvinylchloride, polytetraf luoroethylene, 
polystyrene, polycarbonate, polyacetal, polysulfone, 
celluloseacetate, cellulosenitrate, nitrocellulose, or 
mixtures thereof. 

25 Typically, the solid substrate will be 

rectangular, although other shapes, particularly disks 
and even spheres, present certain advantages. 
Particularly advantageous alternatives to glass slides 
as support substrates for array of nucleic acids are 

30 optical discs, as described in Demers, "Spatially 

Addressable Combinatorial Chemical Arrays in CD-ROM 
Format," international patent publication WO 98/12559, 
incorporated herein by reference in its entirety. 
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The nucleic acids of the present invention 
can be attached covalently to a surface of the support 
substrate or applied to a derivatized surface in a 
chaotropic agent that facilitates denaturation and 
5 adherence by presumed noncovalent interactions, or some 
combination thereof . 

The nucleic acids of the present invention 
can be bound to a substrate to which a plurality of 
other nucleic acids are concurrently bound, 

10 hybridization to each of the plurality of bound nucleic 
acids being separately detectable. At low density, 
e.g. on a porous membrane, these substrate-bound 
collections are typically denominated macroarrays; at 
higher density, typically on a solid support, such as 

15 glass, these substrate bound collections of plural 

nucleic acids are colloquially termed microarrays . As 
used herein, the term microarray includes arrays of all 
densities. It is, therefore, another aspect of the 
invention to provide microarrays that include the 

20 nucleic acids of the present invention. 

The isolated nucleic acids of the present 
invention can be used as hybridization probes to 
detect, characterize, and quantify PAPP-E nucleic acids 
in, and isolate PAPP-E nucleic acids from, both genomic 

25 and transcript-derived nucleic acid samples. When free 
in solution, such probes are typically, but not 
invariably, detectably labeled; bound to a substrate, 
as in a microarray, such probes are typically, but not 
invariably unlabeled. 

30 For example, the isolated nucleic acids of 

the present invention can be used as probes to detect 
and characterize gross alterations in the PAPP-E 
genomic locus, such as deletions, insertions, 
translocations, and duplications of the PAPP-E genomic 
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locus through fluorescence in situ hybridization (FISH) 
to chromosome spreads. See, e.g., Andreeff et al. 
( eds . ) , Introduction to Fluorescence In Situ 
Hybridization: Principles and Clinical Applications , 
5 John Wiley & Sons. (1999) (ISBN: 0471013455), the 

disclosure of which is incorporated herein by reference 
in its entirety. The isolated nucleic acids of the 
present invention can be used as probes to assess 
smaller genomic alterations using, e.g., Southern blot 

10 detection of restriction fragment length polymorphisms. 
The isolated nucleic acids of the present invention can 
be used as probes to isolate genomic clones that 
include the nucleic acids of the present invention, 
which thereafter can be restriction mapped and 

15 sequenced to identify deletions, insertions, 

translocations, and substitutions (single nucleotide 
polymorphisms, SNPs) at the sequence level. 

The isolated nucleic acids of the present 
invention can be also be used as probes to detect, 

20 characterize, and quantify PAPP-E nucleic acids in, and 
isolate PAPP-E nucleic acids from, transcript-derived 
nucleic acid samples. 

For example, the isolated nucleic acids of 
the present invention can be used as hybridization 

25 probes to detect, characterize by length, and quantify 
PAPP-E mRNA by northern blot of total or poly-A+- 
selected RNA samples. For example, the isolated 
nucleic acids of the present invention can be used as 
hybridization probes to detect, characterize by 

30 location, and quantify PAPP-E message by in situ 
hybridization to tissue sections (see, e.g., 
Schwarchzacher et al., In Situ Hybridization , 
Springer-Verlag New York (2000) (ISBN: 0387915966), the 



- 43 - 



disclosure of which is incorporated herein by reference 
in its entirety) . For example, the isolated nucleic 
acids of the present invention can be used as 
hybridization probes to measure the representation of 
5 PAPP-E clones in a cDNA library. For example, the 

isolated nucleic acids of the present invention can be 
used as hybridization probes to isolate PAPP-E nucleic 
acids from cDNA libraries, permitting sequence level 
characterization of PAPP-E messages, including 

10 identification of deletions, insertions, truncations — 
including deletions, insertions, and truncations of 
exons in alternatively spliced forms — and single 
nucleotide polymorphisms. 

All of the aforementioned probe techniques 

15 are well within the skill in the art, and are described 
at greater length in standard texts such as Sambrook et 
aJ., Molecular Cloning: A Laboratory Manual (3 rd ed.), 
Cold Spring Harbor Laboratory Press (2001) (ISBN: 
0879695773); Ausubel et a J . (eds.), Short Protocols in 

20 Molecular Biology: A Compendium of Methods from Current 
Protocols in Molecular Biology (4 th ed.), John Wiley & 
Sons, 1999 (ISBN: 047132938X) ; and Walker et al. 
(eds.), The Nucleic Acids Protocols Handbook , Humana 
Press (2000) (ISBN: 0896034593), the disclosures of 

25 which are incorporated herein by reference in their 
entirety. 

As described in the Examples herein below, 
the nucleic acids of the present invention can also be 
used to detect and quantify PAPP-E nucleic acids in 
30 transcript-derived samples — that is, to measure 

expression of the PAPP-E gene — when included in a 
microarray. Measurement of placental PAPP-E expression 
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has particular utility in prenatal diagnosis, as 
further described in the Examples herein below. 

As would be readily apparent to one of skill 
in the art, each PAPP-E nucleic acid probe — whether 
5 labeled, substrate-bound, or both — is thus currently 
available for use as a tool for measuring the level of 
PAPP-E expression in each of the tissues in which 
expression has already been confirmed, notably 
placenta. The utility is specific to the probe: under 

10 high stringency conditions, the probe reports the level 
of expression of message specifically containing that 
portion of the PAPP-E gene included within the probe. 

Measuring tools are well known in many arts, 
not just in molecular biology, and are known to possess 

15 credible, specific, and substantial utility. For 

example, U.S. Patent No. 6,016,191 describes and claims 
a tool for measuring characteristics of fluid flow in a 
hydrocarbon well; U.S. Patent No. 6,042,549 describes 
and claims a device for measuring exercise intensity; 

20 U.S. Patent No. 5,889,351 describes and claims a device 
for measuring viscosity and for measuring 
characteristics of a fluid; U.S. Patent No. 5,570,694 
describes and claims a device for measuring blood 
pressure; U.S. Patent. No. 5,930,143 describes and 

25 claims a device for measuring the dimensions of machine 
tools; U.S. Patent No. 5,279,044 describes and claims a 
measuring device for determining an absolute position 
of a movable element; U.S. Patent No. 5,186,042 
describes and claims a device for measuring action 

30 force of a wheel; and U.S. Patent No. 4,246,774 

describes and claims a device for measuring the draft 
of smoking articles such as cigarettes. 

As for tissues not yet demonstrated to 
express PAPP-E, the PAPP-E nucleic acid probes of the 
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present invention are currently available as tools for 
surveying such tissues to detect the presence of PAPP-E 
nucleic acids . 

Survey tools — i.e., tools for determining 
5 the presence and/or location of a desired object by 
search of an area — are well known in many arts, not 
just in molecular biology, and are known to possess 
credible, specific, and substantial utility. For 
example, U.S. Patent No. 6,046,800 describes and claims 

10 a device for surveying an area for objects that move; 
U.S. Patent No. 6,025,201 describes and claims an 
apparatus for locating and discriminating platelets 
from non-platelet particles or cells on a cell-by-cell 
basis in a whole blood sample; U.S. Patent No, 

15 5,990,689 describes and claims a device for detecting 
and locating anomalies in the electromagnetic 
protection of a system; U.S. Patent No. 5,984,175 
describes and claims a device for detecting and 
identifying wearable user identification units; U.S. 

20 Patent No. 3,980,986 ("Oil well survey tool"), 

describes and claims a tool for finding the position of 
a drill bit working at the bottom of a borehole. 

As noted above, the nucleic acid probes of 
the present invention are useful in constructing 

25 microarrays; the microarrays, in turn, are products of 
manufacture that are useful for measuring and for 
surveying gene expression. 

When included on a microarray, each PAPP-E 
nucleic acid probe makes the microarray specifically 

30 useful for detecting that portion of the PAPP-E gene 
included within the probe, thus imparting upon the 
microarray device the ability to detect a signal where, 
absent such probe, it would have reported no signal. 
This utility makes each individual probe on such 
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microarray akin to an antenna, circuit, firmware or 
software element included in an electronic apparatus, 
where the antenna, circuit, firmware or software 
element imparts upon the apparatus the ability newly 
5 and additionally to detect signal in a portion of the 
radio-frequency spectrum where previously it could not; 
such devices are known to have specific, substantial, 
and credible utility. 

Changes in expression need not be observed 

10 for the measurement of expression to have utility. 

For example, where gene expression analysis 
is used to assess toxicity of chemical agents on cells, 
the failure of the agent to change a gene's expression 
level is evidence that the drug likely does not affect 

15 the pathway of which the gene's expressed protein is a 
part. Analogously, where gene expression analysis is 
used to assess side effects of pharmacologic 
agents — whether in lead compound discovery or in 
subsequent screening of lead compound derivatives — the 

20 inability of the agent to alter a gene's expression 
level is evidence that the drug does not affect the 
pathway of which the gene's expressed protein is a 
part . 

WO 99/58720, incorporated herein by reference 
2 5 in its entirety, provides methods for quantifying, the 
relatedness of a first and second gene expression 
profile and for ordering the relatedness of a plurality 
of gene expression profiles, without regard to the 
identity or function of the genes whose expression is 
30 used in the calculation. 

Gene expression analysis, including gene 
expression analysis by microarray hybridization, is, of 
course, principally a laboratory-based art. Devices 
and apparatus used principally in laboratories to 
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facilitate laboratory research are well-established to 
possess specific, substantial, and credible utility. 
For example, U.S. Patent No. 6,001,233 describes and 
claims a gel electrophoresis apparatus having a cam- 
5 activated clamp; for example, U.S. Patent No. 6,051,831 
describes and claims a high mass detector for use in 
time-of -flight mass spectrometers; for example, U.S. 
Patent NO. 5,824,269 describes and claims a flow 
cytometer— few gel electrophoresis apparatuses, TOF-MS 

10 devices, or flow cytometers are sold for consumer use. 

Indeed, and in particular, nucleic acid 
microarrays, as devices intended for laboratory use in 
measuring gene expression, are well-established to have 
specific, substantial and credible utility. Thus, the 

15 microarrays of the present invention have at least the 
specific, substantial and credible utilities of the 
microarrays claimed as devices and articles of 
manufacture in the following U.S. patents, the 
disclosures of each of which is incorporated herein by 

20 reference: U.S. Patent Nos. 5,445,934 ("Array of 

oligonucleotides on a solid substrate"); 5,744,305 
("Arrays of materials attached to a substrate"); and 
6,004,752 ("Solid support with attached molecules"). 

Genome-derived single exon probes and genome- 

25 derived single exon probe microarrays have the 

additional utility, inter alia, of permitting high- 
throughput detection of splice variants of the nucleic 
acids of the present invention, as further described in 
copending and commonly owned U.S. Patent application 

30 no. 09/632,366, filed August 3, 2000, the disclosure of 
which is incorporated herein by reference in its 
entirety. 

The isolated nucleic acids of the present 
invention can also be used to prime synthesis of 
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nucleic acid, for purpose of either analysis or 
isolation, using mRNA, cDNA, or genomic DNA as 
template . 

For use as primers, at least 17 contiguous 
5 nucleotides of the isolated nucleic acids of the 

present invention will be used. Often, at least 18, 
19, or 20 contiguous nucleotides of the nucleic acids 
of the present invention will be used, and on occasion 
at least 20, 22, 24, or 25 contiguous nucleotides of 

10 the nucleic acids of the present invention will be 

used, and even 30 nucleotides or more of the nucleic 
acids of the present invention can be used to prime 
specific synthesis . 

The nucleic acid primers of the present 

15 invention can be used, for example, to prime first 
strand cDNA synthesis on an mRNA template* 

Such primer extension can be done directly to 
analyze the message. Alternatively, synthesis on an 
mRNA template can be done to produce first strand cDNA. 

20 The first strand cDNA can thereafter be used, inter 
alia, directly as a single-stranded probe, as above- 
described, as a template for sequencing — permitting 
identification of alterations, including deletions, 
insertions, and substitutions, both normal allelic 

25 variants and mutations associated with abnormal 

phenotypes— or as a template, either for second strand 
cDNA synthesis (e.g., as an antecedent to insertion 
into a cloning or expression vector) , or for 
amplification. 

30 The nucleic acid primers of the present 

invention can also be used, for example, to prime 
single base extension (SBE) for SNP detection (see, 
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e.g., U.S. Pat. No. 6,004,744, the disclosure of which 
is incorporated herein by reference in its entirety) . 

As another example, the nucleic acid primers 
of the present invention can be used to prime 
5 amplification of PAPP-E nucleic acids, using 
transcript-derived or genomic DNA as template. 

Primer-directed amplification methods are now 
well-established in the art. Methods for performing 
the polymerase chain reaction (PCR) are compiled, inter 

10 alia, in McPherson, PCR (Basics: From Background to 
Bench) , Springer Verlag (2000) (ISBN: 0387916008); 
Innis et al . (eds.), PCR Applications: Protocols for 
Functional Genomics , Academic Press (1999) (ISBN: 
0123721857); Gelfand et al . (eds.), PCR Strategies , 

15 Academic Press (1998) (ISBN: 0123721822); Newton et 
al./ PCR, Springer-Verlag New York (1997) (ISBN: 
0387915060); Burke (ed.), PCR: Essential Techniques , 
John Wiley & Son Ltd (1996) (ISBN: 047195697X) ; White 
(ed.), PCR Cloning Protocols : From Molecular Cloning to 

20 Genetic Engineering, Vol. 67, Humana Press (1996) 

(ISBN: 0896033430); McPherson et al . (eds.), PCR 2: A 
Practical Approach , Oxford University Press, Inc. 
(1995) (ISBN: 0199634254), the disclosures of which are 
incorporated herein by reference in their entireties. 

25 Methods for performing RT-PCR are collected, e.g., in 
Siebert et al . (eds.), Gene Cloning and Analysis by 
RT-PCR / Eaton Publishing Company/Bio Techniques Books 
Division, 1998 (ISBN: 1881299147); Siebert (ed.)/ PCR 
Technique : RT-PCR , Eaton Publishing 

30 Company/BioTechniques Books (1995) (ISBN: 1881299139) , 
the disclosure of which is incorporated herein by 
reference in its entirety. 
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Isothermal amplification approaches, such as 
rolling circle amplif ication, are also now well- 
described. See, e.g., Schweitzer et al . , Curr. Opin. 
Biotechnol. 12(l):21-7 (2001); U.S. Patent Nos. 
5 5,854,033 and 5,714,320 and international patent 
publications WO 97/19193 and WO 00/15779, the 
disclosures of which are incorporated herein by 
reference in their entireties. Rolling circle 
amplification can be combined with other techniques to 

10 facilitate SNP detection. See, e.g., Lizardi et a J . , 
Nature Genet. 19(3):225-32 (1998). 

As further described below, nucleic acids of 
the present invention, inserted into vectors that flank 
the nucleic acid insert with a phage promoter, such as 

15 T7, T3, or SP6 promoter, can be used to drive in vitro 
expression of RNA complementary to either strand of the 
nucleic acid of the present invention. The RNA can be 
used, inter alia, as a single-stranded probe, to effect 
subtraction, or for in vitro translation. 

20 As will be further discussed herein below, 

nucleic acids of the present invention that encode 
PAPP-E protein or portions thereof can be used, inter 
alia, to express the PAPP-E proteins or protein 
fragments, either alone, or as part of fusion proteins. 

25 Expression can be from genomic nucleic acids 

of the present invention, or from transcript-derived 
nucleic acids of the present invention. 

Where protein expression is effected from 
genomic DNA, expression will typically be effected in 

30 eukaryotic, typically mammalian, cells capable of 
splicing introns from the initial RNA transcript. 
Expression can be driven from episomal vectors, such as 
EBV-based vectors, or can be effected from genomic DNA 
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integrated into a host cell chromosome. As will be 
more fully described below, where expression is from 
transcript-derived (or otherwise intron-less) nucleic 
acids of the present invention, expression can be 
5 effected in wide variety of prokaryotic or eukaryotic 
cells . 

Expressed in vitro, the protein, protein 
fragment, or protein fusion can thereafter be isolated, 
to be used, inter alia, as a standard in immunoassays 

10 specific for the proteins, or protein isoforms, of the 
present invention; to be used as a therapeutic agent, 
e.g., to be administered as passive replacement therapy 
in individuals deficient in the proteins of the present 
invention, or to be administered as a vaccine; to be 

15 used for in vitro production of specific antibody, the 
antibody thereafter to be used, e.g., as an analytical 
reagent for detection and quantitation of the proteins 
of the present invention or to be used as an 
immunotherapeutic agent . 

20 The isolated nucleic acids of the present 

invention can also be used to drive in vivo expression 
of the proteins of the present invention. In vivo 
expression can be driven from a vector — typically a 
viral vector, often a vector based upon a replication 

2 5 incompetent retrovirus, an adenovirus, or an adeno- 

associated virus (AAV) - for purpose of gene therapy. 
In vivo expression can also be driven from signals 
endogenous to the nucleic acid or from a vector, often 
a plasmid vector, for purpose of "naked" nucleic acid 

30 vaccination, as further described in U.S. Pat. Nos. 

5,589,466; 5,679,647; 5,804,566; 5,830,877; 5,843,913; 
. 5,880,104; 5,958,891; 5,985,847; 6,017,897; 6,110,898; 
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6,204,250, the disclosures of which are incorporated 
herein by reference in their entireties. 

The nucleic acids of the present invention 
can also be used for antisense inhibition of 
5 translation. See Phillips (ed. ) , Antisense Technology/ 
Part B , Methods in Enzymology Vol. 314, Academic Press, 
Inc. (1999) (ISBN: 012182215X) ; Phillips (ed.), 
Antisense Technology, Part A , Methods in Enzymology 
Vol. 313, Academic Press, Inc. (1999) (ISBN: 

10 0121822141); Hartmann et al. (eds.), Manual of 

Antisense Methodology (Perspectives in Antisense 
Science), Kluwer Law International (1999) 
(ISBN:079238539X) ; Stein et al . (eds.), Applied 
Antisense Oligonucleotide Technology , Wiley-Liss (cover 

15 (1998) (ISBN: 0471172790); Agrawal et al . (eds.), 

Antisense Research and Application , Springer-Verlag New 
York, Inc. (1998) (ISBN: 3540638334); Lichtenstein et 
aim (eds.), Antisense Technology: A Practical Approach , 
Vol. 185, Oxford University Press, INC. (1998) (ISBN: 

20 0199635838); Gibson (ed.), Antisense and Ribozvme 
Methodology: Laboratory Companion , Chapman & Hall 
(1997) (ISBN: 3826100794); Chadwick et al . (eds.), 
Oligonucleotides as Therapeutic Agents - Symposium No. 
209 , John Wiley & Son Ltd (1997) (ISBN: 0471972797), 

25 the disclosures of which are incorporated herein by 
reference in their entireties. 

Nucleic acids of the present invention that 
encode full-length human PAPP-E protein isoforms, 
particularly cDNAs encoding full-length isoforms, have 

30 additional, well-recognized, utility as products of 
manufacture suitable for sale. 

For example, cDNAs encoding full length human 
proteins have immediate, real world utility as 
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commercial products suitable for sale. Invitrogen 
Corp. (Carlsbad, CA, USA) , through its Research 
Genetics subsidiary, sells full length human cDNAs 
cloned into one of a selection of expression vectors as 
5 GeneStorm® expression-ready clones; utility is specific 
for the gene, since each gene is capable of being 
ordered separately and has a distinct catalogue number, 
and utility is substantial, each clone selling for 
$650.00 US. 

10 Nucleic acids of the present invention that 

include genomic regions encoding the human PAPP-E 
protein isoforms, or portions thereof, have yet further 
utilities . 

For example, genomic nucleic acids of the 

15 present invention can be used as amplification 

substrates, e.g. for preparation of genome-derived 
single exon probes of the present invention, described 
above, and further described in copending and commonly- 
owned U.S. patent application nos. 09/774,203, filed 

20 January 29, 2001, and 09/632,366, filed August 3, 2000 
and commonly-owned and copending U.S. provisional 
patent application nos. 60/207,456, filed May 26, 2000, 
60/234,687, filed September 21, 2000, 60/236,359, filed 
27 September 2000, the disclosures of which are 

25 incorporated herein by reference in their entireties. 

As another example, genomic nucleic acids of 
the present invention can be integrated non- 
homologously into the genome of somatic cells, e.g. CHO 
cells, COS cells, or 293 cells, with or without 

30 amplification of the insertional locus, in order, e.g., 
to create stable cell lines capable of producing the 
proteins of the present invention. 
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As another example, more fully described 
herein below, genomic nucleic acids of the present 
invention can be integrated nonhomologously into 
embryonic stem (ES) cells to create transgenic non- 
5 human animals capable of producing the proteins of the 
present invention. 

Genomic nucleic acids of the present 
invention can also be used to target homologous 
recombination to the human PAPP-E locus. See, e.g., 

10 U.S. Patent Nos. 6,187,305; 6,204,061; 5,631,153; 

5,627,059; 5,487,992; 5,464,764; 5,614,396; 5,527,695 
and 6,063,630; and Kmiec et al . (eds.), Gene Targeting 
Protocols , Vol. 133, Humana Press (2000) (ISBN: 
0896033600); Joyner (ed. ) , Gene Targeting: A Practical 

15 Approach , Oxford University Press, Inc. (2000) (ISBN: 
0199637938); Sedivy et al . , Gene Targeting , Oxford 
University Press (1998) (ISBN: 071677013X) ; Tymms et 
al. (eds.), Gene Knockout Protocols , Humana Press 
(2000) (ISBN: 0896035727); Mak et al. (eds.), The Gene 

20 Knockout FactsBook . Vol. 2, Academic Press, Inc. (1998) 
(ISBN: 0124660444); Torres et al . , Laboratory Protocols 
for Conditional Gene Targeting , Oxford University Press 
(1997) (ISBN: 019963677X) ; Vega (ed.), Gene Targeting , 
CRC Press, LLC (1994) (ISBN: 084938950X) , the 

25 disclosures of which are incorporated herein by 
reference in their entireties. 

Where the genomic region includes 
transcription regulatory elements, homologous 
recombination can be used to alter the expression of 

30 PAPP-E, both for purpose of in vitro production of 
PAPP-E protein from human cells, and for purpose of 
gene therapy. See, e.g., U.S. Pat. Nos. 5,981,214, 
6,048,524; 5,272,071. 
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Fragments of the nucleic acids of the present 
invention smaller than those typically used for 
homologous recombination can also be used for targeted 
gene correction or alteration, possibly by cellular 
5 mechanisms different from those engaged during 
homologous recombination . 

For example, partially duplexed RNA/DNA 
chimeras have been shown to have utility in targeted 
gene correction, U.S. Pat. Nos. 5,945,339, 5,888,983, 

10 5,871,984, 5,795,972, 5,780,296, 5,760,012, 5,756,325, 
5,731,181, the disclosures of which are incorporated 
herein by reference in their entireties. So too have 
small oligonucleotides fused to triplexing domains have 
been shown to have utility in targeted gene correction, 

15 Culver et a J . , "Correction of chromosomal point 
mutations in human cells with bifunctional 
oligonucleotides," Nature Biotechnol. 17 ( 10) : 989-93 

(1999) , as have oligonucleotides having modified 
terminal bases or modified terminal internucleoside 

20 bonds, Gamper et a 1 . , Nucl . Acids Res. 28 (21 ): 4332-9 

(2000) , the disclosures of which are incorporated 
herein by reference. 

Nucleic acids of the present invention can be 
obtained by using the labeled probes of the present 

25 invention to probe nucleic acid samples, such as 

genomic libraries, cDNA libraries, and mRNA samples, by 
standard techniques. Nucleic acids of the present 
invention can also be obtained by amplification, using 
the nucleic acid primers of the present invention, as 

30 further demonstrated in Example 1, herein below. 

Nucleic acids of the present invention of fewer than 
about 100 nt can also be synthesized chemically, 



typically by solid phase synthesis using commercially 
available automated synthesizers. 



"Full Length" PAPP-E Isoform Nucleic Acids 

In a first series of nucleic acid 
embodiments, the invention provides isolated nucleic 
acids that encode the entirety of a PAPP-E protein 
isoform. As discussed above, the "full-length" nucleic 
acids of the present invention can be used, inter alia, 
to express full length PAPP-Ea, PAPP-Eb, and PAPP-Ec 
isoforms. The full-length nucleic acids can also be 
used as nucleic acid probes; used as probes, the 
isolated nucleic acids of these embodiments will 
hybridize to all known isoforms of PAPP-E without 
discriminating thereamong. 

In a first such embodiment, the invention 
provides an isolated nucleic acid comprising (i) the 
nucleotide sequence of the nucleic acid of ATCC deposit 

/ (ii) the nucleotide sequence of SEQ ID N0:1, 

or (iii) the complement of (i) or (ii) . The ATCC 
deposit has, and SEQ ID N0:1 presents, the entire cDNA 
of PAPP-Ea, including the 5 T untranslated (UT) region 
and 3 ' UT . 

In a second embodiment, the invention 
provides an isolated nucleic acid comprising (i) the 
nucleotide sequence of SEQ ID N0:2, (ii) a degenerate 
variant of the nucleotide sequence of SEQ ID NO: 2, or 
(iii) the complement (i) or (ii) . SEQ ID NO:2 presents 
the open reading frame from SEQ ID NO:l. 

In a third embodiment, the invention provides 
an isolated nucleic acid comprising (i) a nucleotide 
sequence that encodes a polypeptide with the amino acid 
sequence of SEQ ID NO: 3 or (ii) the complement of a 
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nucleotide sequence that encodes a polypeptide with the 
amino acid sequence of SEQ ID NO: 3. SEQ ID NO: 3 
provides the 1791 amino acid sequence of PAPP-Ea . 

In a fourth embodiment, the invention 
5 provides an isolated nucleic acid having a nucleotide 
sequence that (i) encodes a polypeptide having the 
sequence of SEQ ID NO: 3, (ii) encodes a polypeptide 
having the sequence of SEQ ID NO: 3 with conservative 
amino acid substitutions, or (iii) that is the 

10 complement of (i) or (ii) , where SEQ ID NO:3 provides 
the 1791 amino acid sequence of PAPP-Ea. 

In another embodiment, the invention provides 
an isolated nucleic acid comprising (i) the nucleotide 
sequence of the nucleic acid of ATCC deposit , 

15 (ii) the nucleotide sequence of SEQ ID NO: 8 or (iii) 
the complement of (i) or (ii) , where the referenced 
ATCC deposit has, and SEQ ID NO: 8 provides, the 
nucleotide sequence of the entire PAPP-Eb ORF and 
portions of the 3 f UT. 

20 In another embodiment, the invention provides 

an isolated nucleic acid comprising (i) the nucleotide 
sequence of SEQ ID NO: 9, (ii) a degenerate variant of 
the nucleotide sequence of SEQ ID NO: 9, or (iii) the 
complement (i) or (ii), where SEQ ID NO: 9 presents the 

25 nucleotide seuqence of the open reading frame coding 
region of PAPP-Eb cDNA. 

In a further embodiment, the invention 
provides an isolated nucleic acid comprising (i) a 
nucleotide sequence that encodes a polypeptide with the 
.30 amino acid sequence of SEQ ID NO: 10 or (ii) the 

complement of a nucleotide sequence that encodes a 
polypeptide with the amino acid sequence of SEQ ID 
NO: 10, where SEQ ID NO: 10 provides the full length 
amino acid coding sequence of PAPP-Eb. The invention 
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further provides an isolated nucleic acid comprising a 
nucleotide sequence that (i) encodes a polypeptide 
having the sequence of SEQ ID NO: 10, (ii) encodes a 
polypeptide having the sequence of SEQ ID NO: 10 with 
5 conservative amino acid substitutions, or (iii) that is 
the complement of (i) or (ii) . 

The invention also provides isolated nucleic 
acids that encode the entirety of the PAPP-Ec isoform. 

In a first such embodiment, the invention 

10 provides an isolated nucleic acid comprising (i) the 

nucleotide sequence of the nucleic acid of ATCC deposit 

/ (ii) the nucleotide sequence of SEQ ID NO: 15, 

(iii) a degenerate variant of SEQ ID NO: 15, or (iv) the 
complement of (i) , (ii) or (iii), where the referenced 

15 deposit has, and SEQ ID NO: 15 provides, the nucleotide 
sequence of the PAPP-Ec open reading frame. 

In another embodiment, the invention provides 
an isolated nucleic acid comprising (i) a nucleotide 
sequence that encodes a polypeptide with the amino acid 

20 sequence of SEQ ID NO: 16 or (ii) the complement of a 

nucleotide sequence that encodes a polypeptide with the 
amino acid sequence of SEQ ID NO: 16, SEQ ID NO: 16 
provides the entire amino acid sequence of PAPP-Ec. 
The invention further provides an isolated nucleic acid 

25 comprising a nucleotide sequence that (i) encodes a 

polypeptide having the sequence of SEQ ID NO: 16, (ii) 
encodes a polypeptide having the sequence of SEQ ID 
NO: 16 with conservative amino acid substitutions, or 
(iii) that is the complement of (i) or (ii) . 

30 Selected Partial Nucleic Acids 

In a second series of nucleic acid 
embodiments, the invention provides isolated nucleic 
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acids that encode select portions of one or more PAPP-E 
protein isoforms. As will be further discussed herein 
below, these "partial" nucleic acids can be used, inter 
alia, to express specific portions of the PAPP-E 
5 protein isoforms — both those portions that are shared 
by two or more isoforms, and those other portions that 
are unique to one or another of the isoforms in 
vitro and in vivo. These "partial" nucleic acids can 
also be used, inter alia, as nucleic probes; used as 

10 probes, various of these embodiments are able to 

discriminate among the individual PAPP-E isoforms. 

In a first such embodiment, the invention 
provides isolated nucleic acids comprising (i) the 
nucleotide sequence of SEQ ID NO: 4 or (ii) the 

15 complement of the nucleotide sequence of SEQ ID NO: 4, 

wherein the isolated nucleic acid is no more than about 
100 kb in length, typically no more than about 75 kb in 
length, more typically no more than about 50 kb in 
length. SEQ ID NO: 4 is the nucleotide sequence, drawn 

20 from both 5 1 UT and initial coding region, of the PAPP- 
Ea cDNA clone that is absent from the clone encoding 
the PAPP-Ef isoform. Often, the isolated nucleic acids 
of this embodiment are no more than about 25 kb in 
length, often no more than about 15 kb in length, and 

25 frequently no more than about 10 kb in length. 

In another embodiment, the invention provides 
an isolated nucleic acid comprising (i) the nucleotide 
sequence of SEQ ID NO: 5 or (ii) the complement of the 
nucleoide sequence of SEQ ID NO: 5, wherein the isolated 

30 nucleic acid is no more than about 100 kb in length, 
typically no more than about 75 kb in length, more 
typically no more than about 50 kb in length. SEQ ID 
NO: 5 presents the 5' untranslated region of the PAPP-Ea 
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cDNA, which is not found in the PAPP-Ef cDNA. Often, 
the isolated nucleic acids of this ^embodiment are no 
more than about 2 5 kb in length, often no more than 
about 15 kb in length, and frequently no more than 
5 about 10 kb in length. 

In another embodiment, the invention provides 
an isolated nucleic acid comprising (i) the nucleotide 
sequence of SEQ ID NO: 6, (ii) a degenerate variant of 
the nucleotide sequence of SEQ ID NO: 6, or (iii) the 

10 complement (i) or (ii) , wherein the isolated nucleic 

acid is no more than about 100 kb in length, typically 
no more than about 75 kb in length, more typically no 
more than about 50 kb in length, SEQ ID NO: 6 presents 
the nucleotide sequence of the 5' portion of the coding 

15 region of the PAPP-Ea cDNA not found in the PAPP-Ef 
cDNA. Often, the isolated nucleic acids of this 
embodiment are no more than about 25 kb in length, 
often no more than about 15 kb in length, and 
frequently no more than about 10 kb in length. 

20 In yet another embodiment, the invention 

provides isolated nucleic acids comprising (i) a 
nucleotide sequence that encodes a polypeptide with the 
amino acid sequence of SEQ ID NO: 7 or (ii) the 
complement of a nucleotide sequence that encodes a 

25 polypeptide with the amino acid sequence of SEQ ID 

NO: 7, wherein the isolated nucleic acid is no more than 
about 100 kb in length, typically no more than about 75 
kb in length, more typically no more than about 50 kb 
in length. SEQ ID NO: 7 is the amino acid sequence of 

30 the N-terminal coding region of the PAPP-Ea isoform 
absent from the PAPP-Ef cDNA. Often, the isolated 
nucleic acids of this embodiment are no more than about 
2 5 kb in length, often no more than about 15 kb in 



- 61 - 



length, and frequently no more than about 10 kb in 
length . 

In yet a further embodiment/ the invention 
provides an isolated nucleic acid comprising a 
5 nucleotide sequence (i) that encodes a polypeptide 

having the sequence of SEQ. ID NO: 7, (ii) that encodes a 
polypeptide having the sequence of SEQ ID NO: 7 with 
conservative amino acid substitutions, or (iii) the 
complement of (i) or (ii) , wherein the isolated nucleic 

10 acid is no more than about 100 kb in length, typically 
no more than about 75 kb in length, more typically no 
more than about 50 kb in length* Often, the isolated 
nucleic acids of this embodiment are no more than about 
25 kb in length, often no more than about 15 kb in 

15 length, and frequently no more than about 10 kb in 
length. 

In a further embodiment, the invention 
provides an isolated nucleic acid comprising (i) the 
nucleotide sequence of SEQ ID NO: 11, (ii) a degenerate 

20 variant of SEQ ID NO: 11, or (iii) the complement of (i) 
or (ii) , wherein the isolated nucleic acid is no more 
than about 100 kb in length, typically no more than 
about 75 kb in length, more typically no more than 
about 50 kb length. Often, the isolated nucleic acids 

25 of this embodiment are no more than about 25 kb in 

length, often no more than about 15 kb in length, and 
frequently no more than about 10 kb in length. SEQ ID 
NO: 11 provides the portion of the PAPP-E cDNA sequence 
drawn from exon 21, which appears uniquely in the PAPP- 

30 Eb isoform; probes that include SEQ ID NO: 11 and no 
other portions of the PAPP-E gene will be useful in 
discriminating expression of the PAPP-Eb isoform. 

In another embodiment, the invention provides 
an isolated nucleic acid comprising (i) a nucleotide 
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sequence that encodes SEQ ID NO: 12 or (ii) the 
complement of a nucleotide sequence that encodes SEQ ID 
NO: 12, wherein the isolated nucleic acid is no more 
than about 100 kb in length, typically no more than 
5 about 75 kb in length, frequently no more than about 50 
kb in length. SEQ ID NO: 12 provides the amino acid 
sequence encoded by exon 21 that is uniquely present in 
the PAPP-Eb isoform (aa 1735 - 1762) . Often, the 
isolated nucleic acids of this embodiment are no more 
10 than about 25 kb in length, often no more than about 15 
kb in length, and frequently no more than about 10 kb 
in length. 

In another embodiment, the invention provides 
an isolated nucleic acid comprising (i) a nucleotide 

15 sequence that encodes SEQ ID NO: 12, (ii) a nucleotide 
sequence that encodes SEQ ID NO: 12 with conservative 
substititions, or (iii) the complement of (i) or (ii) , 
wherein the isolated nucleic acid is no more than about 
100 kb in length, typically no more than about 75 kb in 

20 length, and often no more than about 50 kb in length. 
Often, the isolated nucleic acids of this embodiment 
are no more than about 25 kb in length, often no more 
than about 15 kb in length, and frequently no more than 
about 10 kb in length. 

25 In a further embodiment, the invention 

provides an isolated nucleic acid comprising (i) a 
nucleotide sequence that encodes a polypeptide with the 
amino acid sequence of SEQ ID NO: 14 or (ii) the 
complement of a nucleotide sequence that encodes a 

30 polypeptide with the amino acid sequence of SEQ ID 
NO: 14, wherein the isolated nucleic acid is no more 
than about 100 kb in length, typically no more than 
about 75 kb in length, often no more than about 50 kb 
in length. SEQ ID NO: 14 is the amino acid sequence 
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unique to the PAPP-Eb isoform, both that encoded by 
exon 21 and that caused by subsequent frameshit (aa 
1735 - 1770) . Often, the isolated nucleic acids of 
this embodiment are no more than about 2 5 kb in length, 
5 often no more than about 15 kb in length, and 
frequently no more than about 10 kb in length. 

In yet another embodiment, the invention 
provides isolated nucleic acids comprising a nucleotide 
sequence (i) that encodes a polypeptide having the 

10 sequence of SEQ ID NO: 14, (ii) that encodes a 

polypeptide having the sequence of SEQ ID NO: 14 with 
conservative amino acid substitutions, or (iii) the 
complement of (i) or (ii), wherein the isolated nucleic 
acid is no more than about 100 kb in length, typically 

15 no more than about 75 kb in length, often no more than 
about 50 kb in length. Often, the isolated nucleic 
acids of this embodiment are no more than about 25 kb 
in length, often no more than about 15 kb in length, 
and frequently no more than about 10 kb in length. 

20 In another embodiment, the invention provides 

an isolated nucleic acid comprising (i) the nucleotide 
sequence of SEQ ID NO: 17, (ii) a degenerate variant of 
SEQ ID NO:17, or (iii) the complement of (i) or (ii) . 
SEQ ID NO: 17 provides the nucleotide sequence 

25 surrounding the junction of exons 1 and 4, a junction 
unique to PAPP-Ec among the PAPP-E isoforms. 

In another embodiment, the invention provides 
isolated nucleic acids comprising (i) a nucleotide 
sequence that encodes SEQ ID NO: 18, (ii) a nucleotide 

30 sequence that encodes SEQ ID NO: 18 with conservative 
amino acid substititions, or (iii) the complement of 
(i) or (ii) . SEQ ID NO: 18 presents the 20 amino acid 
sequence centered at the junction between exons 1 and 
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4, a sequence unique to PAPP-Ec among the PAPP-E 
isoforms . 

Cross-Hvbridizina Nucleic Acids 

In another series of nucleic acid 
5 embodiments , the invention provides isolated nucleic 
acids that hybridize to various of the PAPP-E nucleic 
acids of the present invention. These cross- 
hybridizing nucleic acids can be used, inter alia, as 
probes for, and to drive expression of, proteins that 

10 are related to the PAPP-E isoforms of the present 

invention as further isoforms, homologues, paralogues, 
or orthologues. 

In a first such embodiment, the invention 
provides an isolated nucleic acid comprising a sequence 

15 that hybridizes under high stringency conditions to a 
probe the nucleotide sequence of which consists of SEQ 
ID NO: 4 or the complement of SEQ ID NO: 4, wherein the 
isolated nucleic acid is no more than about 100 kb in 
length, typically no more than about 75 kb in length, 

20 and often no more than about 50 kb in length. Often, 
the isolated nucleic acids of this embodiment are no 
more than about 25 kb in length, often no more than 
about 15 kb in length, and frequently no more than 
about 10 kb in length. 

25 In a further embodiment, the invention 

provides an isolated nucleic acid comprising a sequence 
that hybridizes under moderate stringency conditions to 
a probe the nucleotide sequence of which consists of 
SEQ ID NO: 4 or the complement of SEQ ID NO: 4, wherein 

30 the isolated nucleic acid is no more than about 100 kb 
in length, typically no more than about 7 5 kb in 
length, and often no more than about 50 kb in length. 
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Often, the isolated nucleic acids of this embodiment 
are no more than about 25 kb in length, often no more 
than about 15 kb in length, and frequently no more than 
about 10 kb in length. 

In another embodiment, the invention provides 
an isolated nucleic acid comprising a sequence that 
hybridizes under high stringency conditions to a 
hybridization probe that consists of a nucleotide 
sequence that encodes SEQ ID NO: 5, wherein the isolated 
nucleic acid is no more than about 100 kb in length, 
typically no more than about 75 kb in length, and often 
no more than about 50 kb in length. Often, the 
isolated nucleic acids of this embodiment are no more 
than about 25 kb in length, often no more than about 15 
kb in length, and frequently no more than about 10 kb 
in length. 

In yet another embodiment, the invention 
provides an isolated nucleic acid comprising a sequence 
that hybridizes under moderate stringency conditions to 
a hybridization probe consisting of a nucleotide 
sequence that encodes SEQ ID NO: 5, wherein the isolated 
nucleic acid is no more than about 100 kb in length, 
typically no more than about 75 kb in length, and often 
no more than about 50 kb in length. Often, the 
isolated nucleic acids of this embodiment are no more 
than about 25 kb in length, often no more than about 15 
kb in length, and frequently no more than about 10 kb 
in length. 

In an additional embodiment, the invention 
provides an isolated nucleic acid comprising a sequence 
that hybridizes under high stringency conditions to a 
hybridization probe the nucleotide sequence of which 
consists of SEQ ID NO: 6 or the complement of SEQ ID 
NO: 6, wherein the isolated nucleic acid is no more than 



- 66 - 



about 100 kb in length, typically no more than about 75 
kb in length, and often no more than about 50 kb in 
length* Often, the isolated nucleic acids of this 
embodiment are no more than about 25 kb in length, 
5 often no more than about 15 kb in length, and 
frequently no more than about 10 kb in length. 

The invention further provides an isolated 
nucleic acid comprising a sequence that hybridizes 
under moderate stringency conditions to a hybridization 

10 probe the nucleotide sequence of which consists of SEQ 
ID NO: 6 or the complement of SEQ ID NO: 6, wherein the 
isolated nucleic acid is no more than about 100 kb in 
length, typically no more than about 75 kb in length, 
and often no more than about 50 kb in length. Often, 

15 the isolated nucleic acids of this embodiment are no 
more than about 25 kb in length, often no more than 
about 15 kb in length, and frequently no more than 
about 10 kb in length. 

The invention also provides an isolated 

20 nucleic acid comprising a sequence that hybridizes 
under high stringency conditions to a hybridization 
probe the nucleotide sequence of which (i) encodes a 
polypeptide having the sequence of SEQ ID NO: 7, (ii) 
encodes a polypeptide having the sequence of SEQ ID 

25 NO: 7 with conservative amino acid substitutions, or 
(iii) is the complement of (i) or (ii) , wherein the 
isolated nucleic acid is no more than about 100 kb in 
length, typically no more than about 7 5 kb in length, 
and often no more than about 50 kb in length. Often, 

30 the isolated nucleic acids of this embodiment are no 
more than about 25 kb in length, often no more than 
about 15 kb in length, and frequently no more than 
about 10 kb in length. 
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Additionally, the invention provides an 
isolated nucleic acid comprising a sequence that 
hybridizes under moderate stringency conditions to a 
hybridization probe the nucleotide sequence of which 
5 (i) encodes a polypeptide having the sequence of SEQ ID 
NO: 7, (ii) encodes a polypeptide having the sequence of 
SEQ ID NO: 7 with conservative amino acid substitutions, 
or (iii) is the complement of (i) or (ii), wherein the 
isolated nucleic acid is no more than about 100 kb in 

10 length, typically no more than about 75 kb in length, 
and often no more than about 50 kb in length. Often, 
the isolated nucleic acids of this embodiment are no 
more than about 25 kb in length, often no more than 
about 15 kb in length, and frequently no more than 

15 about 10 kb in length. 

In a further embodiment, the invention 
provides an isolated nucleic acid comprising a sequence 
that hybridizes under high stringency conditions to a 
probe the nucleotide sequence of which consists of SEQ 

20 ID NO: 11 or the complement of SEQ ID NO: 11, wherein the 
isolated nucleic acid is no more than about 100 kb in 
length, typically no more than about 75 kb in length, 
and often no more than about 50 kb in length. Often, 
the isolated nucleic acids of this embodiment are no 

25 more than about 25 kb in length, often no more than 
about 15 kb in length, and frequently no more than 
about 10 kb in length. 

In another embodiment, the invention provides 
an isolated nucleic acid comprising a sequence that 

30 hybridizes under moderate stringency conditions to a 

probe the nucleotide sequence of which consists of SEQ 
ID NO: 11 or the complement of SEQ ID NO: 11, wherein the 
isolated nucleic acid is no more than about 100 kb in 
length, typically no more than about 75 kb in length, 
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and often no more than about 50 kb in length. Often, 
the isolated nucleic acids of this embodiment are no 
more than about 25 kb in length, often no more than 
about 15 kb in length, and frequently no more than 
5 about 10 kb in length. 

In a further embodiment, the invention 
provides an isolated nucleic acid comprising a sequence 
that hybridizes under high stringency conditions to a 
hybridization probe the nucleotide sequence of which 

10 (i) encodes a polypeptide having the sequence of SEQ ID 
NO: 12, (ii) encodes a polypeptide having the sequence 
of SEQ ID NO: 12 with conservative amino acid 
substitutions, or (iii) is the complement of (i) or 
(ii), wherein the isolated nucleic acid is no more than 

15 about 100 kb in length, typically no more than about 75 
kb in length, and often no more than about 50 kb in 
length. Often, the isolated nucleic acids of this 
embodiment are no more than about 25 kb in length, 
often no more than about 15 kb in length, and 

2 0 frequently no more than about 10 kb in length. 

In a yet further embodiment, the invention 
provides an isolated nucleic acid comprising a sequence 
that hybridizes under moderate stringency conditions to 
a hybridization probe the nucleotide sequence of which 

25 (i) encodes a polypeptide having the sequence of SEQ ID 
NO: 12, (ii) encodes a polypeptide having the sequence 
of SEQ ID NO: 12 with conservative amino acid 
substitutions, or (iii) is the complement of (i) or 
(ii), wherein the isolated nucleic acid is no more than 

30 about 100 kb in length, typically no more than about 75 
kb in length, and often no more than about 50 kb in 
length. Often, the isolated nucleic acids of this 
embodiment are no more than about 2 5 kb in length, 
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often no more than about 15 kb in length, and 
frequently no more than about 10 kb in length. 

Preferred Nucleic Acids 

Particularly preferred among the above- 
5 described nucleic acids are those that are expressed, 
or the complement of which are expressed, in placental 
tissue, preferably at a level greater than that in HeLa 
cells, typically at a level at least two-fold that in 
HeLa cells, often at least three-fold, four-fold, or 
10 even five-fold that in HeLa cells. 

Also particularly preferred among the above- 
described nucleic acids are those that encode, or the 
complement of which encode, a polypeptide having 
metalloproteinase activity, particularly those having 
15 cleavage specificity for an IGF binding protein. 

Other preferred embodiments of the nucleic 
acids above-described are those that encode, or the 
complement of which encode, a polypeptide having any or 
all of (i) at least one zinc binding domain, (ii) at 
20 least one notch domain, and (iii) tandemly repeated SCR 
domains . 

Nucleic Acid Fragments 

In another series of nucleic acid 
embodiments, the invention provides fragments of 
25 various of the isolated nucleic acids of the present 
invention which prove useful, inter alia, as nucleic 
acid probes, as amplification primers, and to direct 
expression or synthesis of epitopic or immunogenic 
protein fragments. 
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In a first such embodiment/ the invention 
provides an isolated nucleic acid comprising at least 
17 nucleotides, 18 nucleotides, 20 nucleotides, 24 
nucleotides, or 25 nucleotides of (i) SEQ ID NO: 4 or 
5 (ii) the complement of SEQ ID NO: 4, wherein the 

isolated nucleic acid is no more than about 100 kb in 
length, typically no more than about 75 kb in length, 
more typically no more than about 50 kb in length. SEQ 
ID NO: 4 is the nucleotide sequence of the 5 1 region of 

10 the PAPP-Ea cDNA absent from the PAPP-Ef cDNA; 

accordingly, these fragments can be used to identify 
PAPP-E isoforms other than PAPP-Ef. Often, the 
isolated nucleic acids of this embodiment are no more 
than about 25 kb in length, often no more than about 15 

15 kb in length, and frequently no more than about 10 kb 
in length. 

In another embodiment, the invention provides 
an isolated nucleic acid comprising at least 17 
nucleotides, 18 nucleotides, 20 nucleotides, 24 

20 nucleotides, or 25 nucleotides of (i) SEQ ID NO: 5 or 
(ii) the complement of SEQ ID NO: 5, wherein the 
isolated nucleic acid is no more than about 100 kb in 
length, typically no more than about 75 kb in length, 
more typically no more than about 50 kb in length. SEQ 

25 ID NO: 5 is the nucleotide sequence of the 5' UT of the 
PAPP-Ea cDNA, which is absent from the PAPP-Ef cDNA; 
accordingly, these fragments can be used to identify 
PAPP-E isoforms other than PAPP-Ef. Often, the 
isolated nucleic acids of this embodiment are no more 

30 than about 2 5 kb in length, often no more than about 15 
kb in length, and frequently no more than about 10 kb 
in length. 

In a yet further embodiment, the invention 
provides an isolated nucleic acid comprising at least 
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at least 17 nucleotides, 18 nucleotides, 20 
nucleotides, 24 nucleotides, or 25 nucleotides of (i) 
SEQ ID NO: 6, (ii) a degenerate variant of SEQ ID NO: 6, 
or (ii) the complement of (i) or (ii) , wherein the 
5 isolated nucleic acid is no more than about 100 kb in 
length, typically no more than about 75 kb in length, 
more typically no more than about 50 kb in length. SEQ 
ID NO: 6 is the nucleotide sequence encoding the N- 
terminal amino acids absent from the PAPP-Ef isoform; 

10 accordingly, these fragments can be used to identify 
PAPP-E isoforms other than PAPP-Ef, Often, the 
isolated nucleic acids of this embodiment are no more 
than about 25 kb in length, often no more than about 15 
kb in length, and frequently no more than about 10 kb 

15 in length. 

In another embodiment, the invention provides 
an isolated nucleic acid comprising a nucleotide 
sequence that (i) encodes a polypeptide having the 
sequence of at least 8 contiguous amino acids of SEQ ID 

20 NO: 7, (ii) encodes a polypeptide having the sequence of 
at least 8 contiguous amino acids of SEQ ID NO: 7 with 
conservative amino acid substitutions, or (iii) is the 
complement of (i) or (ii) , wherein the isolated nucleic 
acid is no more than about 100 kb in length, typically 

25 no more than about 75 kb in length, more typically no 
more than about 50 kb in length. SEQ ID NO: 7 is the 
amino acid sequence of the 19 N-terminal amino acids 
absent from the PAPP-Ef isoform. Often, the isolated 
nucleic acids of this embodiment are no more than about 

30 25 kb in length, often no more than about 15 kb in 
length, and frequently no more than about 10 kb in 
length. 

In a further embodiment, the invention 
provides an isolated nucleic acid comprising at least 
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17 nucleotides, 18 nucleotides, 20 nucleotides, 24 
nucleotides, or 25 nucleotides of (i) SEQ ID NO: 11, 
(ii) a degenerate variant of SEQ ID NO: 11, or (iii) the 
complement of (i) or (ii) , wherein the isolated nucleic 
5 acid is no more than about 100 kb in length, typically 
no more than about 75 kb in length, more typically no 
more than about 50 kb in length. Often, the isolated 
nucleic acids of this embodiment are no more than about 
25 kb in length, often no more than about 15 kb in 
10 length, and frequently no more than about 10 kb in 
length . 

The invention also provides an isolated 
nucleic acid comprising (i) a nucleotide sequence that 
encodes a peptide of at least 8 contiguous amino acids 

15 of SEQ ID NO: 12, or (ii) the complement of a nucleotide 
sequence that encodes a peptide of at least 8 
contiguous amino acids of SEQ ID NO: 12, wherein the 
isolated nucleic acid is no more than about 100 kb in 
length, typically no more than about 75 kb in length, 

20 more typically no more than about 50 kb in length. 

Often, the isolated nucleic acids of this embodiment 
are no more than about 25 kb in length, often no more 
than about 15 kb in length, and frequently no more than 
about 10 kb in length. 

25 The invention also provides an isolated 

nucleic acid comprising a nucleotide sequence that (i) 
encodes a polypeptide having the sequence of at least 8 
contiguous amino acids of SEQ ID NO: 12, (ii) encodes a 
polypeptide having the sequence of at least 8 

30 contiguous amino acids of SEQ ID NO: 12 with 

conservative amino acid substitutions, or (iii) is the 
complement of (i) or (ii) . 

The structural chemical formulas of 
representative 17-mer nucleic acid fragments of the 
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present invention as above-described are provided as 
SEQ ID NOs: [x - x'] and [z - z'] in the attached 
Sequence Listing, incorporated herein by reference in 
its entirety. The structural chemical formulas of 
5 representative 25-mer nucleic acid fragments of the 
present invention as above-described are reduced to 
drawings as SEQ ID Nos : [y - y 1 ] and [* - * f ] in the 
attached Sequence Listing, incorporated herein by 
reference in its entirety. 

10 Single Exon Probes 

The invention further provides genome-derived 
single exon probes having portions of no more than one 
exon of the PAPP-E gene. As further described in 
commonly owned and copending U.S. patent application 

15 serial no. 09/632,366 ("Methods and Apparatus for High 
Throughput Detection and Characterization of 
alternatively Spliced Genes"), the disclosure of which 
is incorporated herein by reference in its entirety, 
such single exon probes have particular utility in 

20 identifying and characterizing splice variants. In 
particular, such single exon probes are useful for 
identifying and discriminating the expression of PAPP- 
Ea, PAPP-Eb, and PAPP-Ec isoforms. 

In a first embodiment, the invention provides 

25 an isolated nucleic acid comprising a nucleotide 

sequence of no more than one portion of SEQ ID NOs: 19 
to 41 or the complement of SEQ ID NOs: 19 to 41, 
wherein the portion comprises at least 17 contiguous 
nucleotides, 18 contiguous nucleotides, 20 contiguous 

30 nucleotides, 24 contiguous nucleotides, 25 contiguous 

nucleotides, or 50 contiguous nucleotides of any one of 
SEQ ID NOs: ,19 to 41, or their complement, and 
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hybridizes under high stringency conditions to a 
nucleic acid expressed in human placenta. In a further 
embodiment, the exonic portion comprises the entirety 
of the referenced SEQ ID NO: or its complement. 
5 In other embodiments, the invention provides 

isolated single exon probes having the nucleotide 
sequence of any one of SEQ ID NOs : 42-65. 

In a particular embodiment, the invention 
provides a single exon probe having a portion of SEQ ID 
10 NO: 39. SEQ ID NO: 39 presents the exon (exon 21) that 
is unique to the PAPP-Eb isoform; single exon probes, 
including genome-derived single exon probes, having a 
portion drawn from exon 21 can be used to identify and 
or measure expression of PAPP-Eb. 

15 

Transcription Control Nucleic Acids 

In another aspect, the present invention 
provides genome-derived isolated nucleic acids that 
include nucleic acid sequence elements that control 

20 transcription of the PAPP-E gene and its various 

isoforms. These nucleic acids can be used, inter alia, 
to drive expression of heterologous coding regions in 
recombinant constructs, thus conferring upon such 
hetereologous coding regions the expression pattern of 

25 the native PAPP-E gene. These nucleic acids can also 
be used, conversely, to target heterologous 
transcription control elements to the PAPP-E genomic 
locus, altering the expression pattern of the PAPP-E 
gene itself. 

30 In a first such embodiment, the invention 

provides an isolated nucleic acid comprising the 
nucleotide sequence of SEQ ID NO: 65 or its complement, 
wherein the isolated nucleic acid is no more than about 
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100 kb in length, typically no more than about 75 kb in 
length, more typically no more than about 50 kb in 
length. Often, the isolated nucleic acids of this 
embodiment are no more than about 2 5 kb in length, 
5 often no more than about 15 kb in length, and 
frequently no more than about 10 kb in length. 

In another embodiment, the invention provides 
an isolated nucleic acid comprising at least 17, 18, 
20, 24, or 25 nucleotides of the sequence of SEQ ID 
10 NO: 65 or its complement, wherein wherein the isolated 
nucleic acid is no more than about 

100 kb in length, typically no more than about 75 kb in 
length, more typically no more than about 50 kb in 
length. Often, the isolated nucleic acids of this 
15 embodiment are no more than about 25 kb in length, 
often no more than about 15 kb in length, and 
frequently no more than about 10 kb in length. 



VECTORS AND HOST CELLS 

In another aspect, the present invention 
20 provides vectors that comprise one or more of the 

isolated nucleic acids of the present invention, and 
host cells in which such vectors have been introduced. 

The vectors can be used, inter alia, for 
propagating the nucleic acids of the present invention 
25 in host cells (cloning vectors) , for shuttling the 
nucleic acids of the present invention between host 
cells derived from disparate organisms (shuttle 
vectors) , for inserting the nucleic acids of the 
present invention into host cell chromosomes (insertion 
30 vectors) , for expressing sense or antisense RNA 
transcripts of the nucleic acids of the present 
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invention in vitro or within a host cell, and for 
expressing polypeptides encoded by the nucleic acids of 
the present invention, alone or as fusions to 
heterologous polypeptides. Vectors of the present 
5 invention will often be suitable for several such uses. 

Vectors are by now well-known in the art, and 
are described, inter alia, in Jones et al. (eds.), 
Vectors: Cloning Applications : Essential Techniques 
(Essential Techniques Series), John Wiley & Son Ltd 

10 1998 (ISBN: 047196266X) ; Jones et al. (eds.), Vectors : 
Expression Systems : Essential Techniques (Essential 
Techniques Series), John Wiley & Son Ltd, 1998 
(ISBN: 0471962678) ; Gacesa et al., Vectors : Essential 
Data , John Wiley & Sons, 1995 (ISBN: 0471948411); 

15 Cid-Arregui (eds.), Viral Vectors: Basic Science and 
Gene Therapy , Eaton Publishing Co., 2000 (ISBN: 
188129935X) ; Sambrook et al., Molecular Cloning: A 
Laboratory Manual (3 rd ed.), Cold Spring Harbor 
Laboratory Press, 2001 (ISBN: 0879695773); Ausubel et 

20 al. (eds.), Short Protocols in Molecular Biology: A 
Compendium of Methods from Current Protocols in 
Molecular Biology (4 th ed.), John Wiley & Sons, 1999 
(ISBN: 047132938X), the disclosures of which are 
incorporated herein by reference in their entireties. 

25 Furthermore, an enormous variety of vectors are 

available commercially. Use of existing vectors and 
modifications thereof being well within the skill in 
the art, only basic features need be described here. 

Typically, vectors are derived from virus, 

30 plasmid, prokaryotic or eukaryotic chromosomal 

elements, or some combination thereof, and include at 
least one origin of replication, at least one site for 
insertion of heterologous nucleic acid, typically in 
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the form of a polylinker with multiple, tightly 
clustered, single cutting restriction sites, and at 
least one selectable marker, although some integrative 
vectors will lack an origin that is functional in the 
5 host to be chromosomally modified, and some vectors 
will lack selectable markers. Vectors of the present 
invention will further include at least one nucleic 
acid of the present invention inserted into the vector 
in at least one location. 

10 Where present, the origin of replication and 

selectable markers are chosen based upon the desired 
host cell or host cells; the host cells, in turn, are 
selected based upon the desired application. 

For example, prokaryotic cells, typically 

15 E. coli, are typically chosen for cloning. In such 
case, vector replication is predicated on the 
replication strategies of colif orm-inf ecting phage — 
such as phage lambda, M13, T7, T3 and PI — or on the 
replication origin of autonomously replicating 

20 episomes, notably the ColEl plasmid and later 

derivatives, including pBR322 and the pUC series 
plasmids. Where E. coli is used as host, selectable 
markers are, analogously, chosen for selectivity in 
gram negative bacteria: e.g., typical markers confer 

25 resistance to antibiotics, such as ampicillin, 

tetracycline, chlorampenicol, kanamycin, streptomycin, 
zeocin; auxotrophic markers can also be used. 

As another example, yeast cells, typically 5. 
cerevisiae, are chosen, inter alia, for eukaryotic 

30 genetic studies, due to the ease of targeting genetic 
changes by homologous recombination and to the ready 
ability to complement genetic defects using 
recombinantly expressed proteins, for identification of 
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interacting protein components, e.g. through use of a 
two-hybrid system, and for protein expression. Vectors 
of the present invention for use in yeast will 
typically, but not invariably, contain an origin of 
5 replication suitable for use in yeast and a selectable 
marker that is functional in yeast. 

Integrative Yip vectors do not replicate 
autonomously, but integrate, typically in single copy, 
into the yeast genome at low frequencies and thus 

10 replicate as part of the host cell chromosome; these 
vectors lack an origin of replication that is 
functional in yeast, although they typically have at 
least one origin of replication suitable for progation 
of the vector in bacterial cells. YEp vectors, in 

15 contrast, replicate episomally and autonomously due to 
presence of the yeast 2 micron plasmid origin (2 urn 
ori) . The YCp yeast centromere plasmid vectors are 
autonomously replicating vectors containing centromere 
sequences, CEN, and autonomously replicating sequences, 

2 0 ARS; the ARS sequences are believed to correspond to 
the natural replication origins of yeast chromosomes. 
YACs are based on yeast linear plasmids, denoted YLp, 
containing homologous or heterologous DNA sequences 
that function as telomeres (TEL) in vivo, as well as 

25 containing yeast ARS (origins of replication) and CEN 
(centromeres) segments . 

Selectable markers in yeast vectors include a 
variety of auxotrophic markers, the most common of 
which are (in Saccharomyces cerevisiae) URA3, HIS3, 

30 LEU2, TRP1 and LYS2, which complement specific 

auxotrophic mutations, such as ura3-52, his3-Dl, 
Ieu2-Dl, trpl-Dl and lys2-201. The URA3 and LYS2 yeast 
genes further permit negative selection based on 
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specific inhibitors, 5-f luoro-orotic acid (FOA) and 
a-aminoadipic acid (aAA) , respectively, that prevent 
growth of the prototrophic strains but allows growth of 
the ura3 and lys2 mutants, respectively. Other 
5 selectable markers confer resistance to, e.g., zeocin. 

As yet another example, insect cells are 
often chosen for high efficiency protein expression. 
Where the host cells are from Spodoptera frugiperda — 
e.g., Sf9 and Sf21 cell lines, and expresSF™ cells 

10 (Protein Sciences Corp., Meriden, CT, USA) — the vector 
replicative strategy is typically based upon the 
baculovirus life cycle. Typically, baculovirus 
transfer vectors are used to replace the wild-type 
AcMNPV polyhedrin gene with a heterologous gene of 

15 interest. Sequences that flank the polyhedrin gene in 
the wild-type genome are positioned 5' and 3 f of the 
expression cassette on the transfer vectors. Following 
cotrans feet ion with AcMNPV DNA, a homologous 
recombination event occurs between these sequences 

20 resulting in a recombinant virus carrying the gene of 

interest and the polyhedrin or plO promoter. Selection 
can be based upon visual screening for lacZ fusion 
activity. 

As yet another example, mammalian cells are 
25 often chosen for expression of proteins intended as 
pharmaceutical agents, and are also chosen as host 
cells for screening of potential agonist and 
antagonists of a protein or a physiological pathway. 

Where mammalian cells are chosen as host 
30 cells, vectors intended for autonomous extrachromosomal 
replication will typically include a viral origin, such 
as the SV40 origin (for replication in cell lines 
expressing the large T-antigen, such as COS1 and COS7 
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cells), the papillomavirus origin, or the EBV origin 
for long term episomal replication (for use, e.g., in 
293-EBNA cells, which constitutively express the EBV 
EBNA-1 gene product and adenovirus E1A) . Vectors 
5 intended for integration, and thus replication as part 
of the mammalian chromosome, can, but need not, include 
an origin of replication functional in mammalian cells, 
such as the SV40 origin. Vectors based upon viruses, 
such as adenovirus, adeno-associated virus, vaccinia 
10 virus, and various mammalian retroviruses, will 

typically replicate according to the viral replicative 
strategy. 

Selectable markers for use in mammalian cells 
include resistance to neomycin (G418) , blasticidin, 

15 hygromycin and to zeocin, and selection based upon the 
purine salvage pathway using HAT medium. 

Vectors of the present invention will also 
often include elements that permit in vitro 
transcription of RNA from the inserted heterologous 

20 nucleic acid. Such vectors typically include a phage 
promoter, such as that from T7, T3, or SP6, flanking 
the nucleic acid insert. Often two different such 
promoters flank the inserted nucleic acid, permitting 
separate in vitro production of both sense and 

25 antisense strands. 

Expression vectors of the present 
invention — that is, those vectors that will drive 
expression of polypeptides from the inserted 
heterologous nucleic acid — will often include a 

30 variety of other genetic elements operatively linked to 
the protein-encoding heterologous nucleic acid insert, 
typically genetic elements that drive transcription, 
such as promoters and enhancer elements, those that 



- 81 - 



facilitate RNA processing, such as transcription 
termination and/or polyadenylation signals, and those 
that facilitate translation, such as ribosomal 
consensus sequences . 
5 For example, vectors for expressing proteins 

of the present invention in prokaryotic cells, 
typically E. coli, will include a promoter, often a 
phage promoter, such as phage lambda pL promoter, the 
trc promoter, a hybrid derived from the trp and lac 

10 promoters, the bacteriophage T7 promoter (in E. coli 

cells engineered to express the T7 polymerase), or the 
araBAD operon. Often, such prokaryotic expression 
vectors will further include transcription terminators, 
such as the aspA terminator, and elements that 

15 facilitate translation, such as a consensus ribosome 

binding site and translation termination codon, Schomer 
et al., Proc. Natl. Acad. Sci. USA 83:8506-8510 (1986). 

As another example, vectors for expressing 
proteins of the present invention in yeast cells, 

20 typically S. cerevisiae, will include a yeast promoter, 
such as the CYC1 promoter, the GAL1 promoter, ADH1 
promoter, or the GPD promoter, and will typically have 
elements that facilitate transcription termination, 
such as the transcription termination signals from the 

25 CYC1 or ADH1 gene. 

As another example, vectors for expressing 
proteins of the present invention in mammalian cells 
will include a promoter active in mammalian cells. 
Such promoters are often drawn from mammalian 

30 viruses — such as the enhancer-promoter sequences from 
the immediate early gene of the human cytomegalovirus 
(CMV) , the enhancer-promoter sequences from the Rous 
sarcoma virus long terminal repeat (RSV LTR) , and the 
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enhancer-promoter from SV40. Often, expression is 
enhanced by incorporation of polyadenylation sites, 
such as the late SV40 polyadenylation site and the 
polyadenylation signal and transcription termination 
5 sequences from the bovine growth hormone (BGH) gene, 
and ribosome binding sites. Furthermore, vectors can 
include introns, such as intron II of rabbit p-globin 
gene and the SV40 splice elements. 

Vector-drive protein expression can be 

10 constitutive or inducible. 

Inducible vectors include either naturally 
inducible promoters, such as the trc promoter, which is 
regulated by the lac operon, and the pL promoter, which 
is regulated by tryptophan, the MMTV-LTR promoter, 

15 which is inducible by dexamethasone, or can contain 
synthetic promoters and/or additional elements that 
confer inducible control on adjacent promoters. 
Examples of inducible synthetic promoters are the 
hybrid Plac/ara-1 promoter and the PLtetO-1 promoter. 

20 The PltetO-1 promoter takes advantage of the 

high expression levels from the PL promoter of phage 
lambda, but replaces the lambda repressor sites with 
two copies of operator 2 of the TnlO tetracycline 
resistance operon, causing this promoter to be tightly 

25 repressed by the Tet repressor protein and induced in 
response to tetracycline (Tc) and Tc derivatives such 
as anhydrotetracycline . 

As another example of inducible elements, 
hormone response elements, such as the glucocorticoid 

30 response element (GRE) and the estrogen response 

element (ERE) , can confer hormone inducibility where 
vectors are used for expression in cells having the 
respective hormone receptors . To reduce background 
levels of expression, elements responsive to ecdysone, 
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an insect hormone, can be used instead, with 
coexpression of the ecdysone receptor. 

Expression vectors can be designed to fuse 
the expressed polypeptide to small protein tags that 
5 facilitate purification and/or visualization. 

For example, proteins can be expressed with a 
polyhistidine tag that facilitates purification of the 
fusion protein by immobilized metal affinity 
chromatography, for example using NiNTA resin (Qiagen 

10 Inc., Valencia, CA, USA) or TALON™ resin (cobalt 

immobilized affinity chromatography medium, Clontech 
Labs, Palo Alto, CA, USA) . As another example, the 
fusion protein can include a chitin-binding tag and 
self-excising intein, permitting chitin-based 

15 purification with self-removal of the fused tag 

( IMPACT™ system, New England Biolabs, Inc., Beverley, 
MA, USA) . Alternatively, the fusion protein can 
include a calmodulin-binding peptide tag, permitting 
purification by calmodulin affinity resin (Stratagene, 

20 La Jolla, CA, USA) , or a specifically excisable 

fragment of the biotin carboxylase carrier protein, 
permitting purification of in vivo biotinylated protein 
using an avidin resin and subsequent tag removal 
(Promega, Madison, WI, USA) . 

25 Other tags include, for example, the Xpress 

epitope, detectable by anti-Xpress antibody 
(Invitrogen, Carlsbad, CA, USA), a myc tag, detectable 
by anti-myc tag antibody, the V5 epitope, detectable by 
anti-V5 antibody (Invitrogen, Carlsbad, CA, USA), FLAG® 

30 epitope, detectable by anti-FLAG® antibody (Stratagene, 
La Jolla, CA, USA), and the HA epitope. 

For secretion of expressed proteins, vectors 
can include appropriate sequences that encode secretion 
signals, such as leader peptides. For example, the 
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pSecTag2 vectors ( Invitrogen, Carlsbad, CA, USA) are 
5.2 kb mammalian expression vectors that carry the 
secretion signal from the V-J2-C region of the mouse Ig 
kappa-chain for efficient secretion of recombinant 
5 proteins from a variety of mammalian cell lines. 

Expression vectors can also be designed to 
fuse proteins encoded by the heterologous nucleic acid 
insert to polypeptides larger than purification and/or 
identification tags. Useful protein fusions include 

10 those that permit display of the encoded protein on the 
surface of a phage or cell, fusions to intrinsically 
fluorescent proteins, such as green fluorescent protein 
(GFP) , fusions to the IgG Fc region, and fusions for 
use in two hybrid systems. 

15 Vectors for phage display fuse the encoded 

polypeptide to, e.g., the gene III protein (pill) or 
gene VIII protein (pVIII) for display on the surface of 
filamentous phage, such as M13. See Barbas et al . , 
Phage Display: A Laboratory Manual , Cold Spring Harbor 

20 Laboratory Press (2001) (ISBN 0-87969-546-3); Kay et 
al. (eds.), Phage Display of Peptides and Proteins: A 
Laboratory Manual , San Diego: Academic Press, Inc., 
1996; Abelson et al. (eds.), Combinatorial Chemistry , 
Methods in Enzymology vol. 2 67, Academic Press (May 

25 1996) . 

Vectors for yeast display, e.g. the pYDl 
yeast display vector (Invitrogen, Carlsbad, CA, USA), 
use the a-agglutinin yeast adhesion receptor to display 
recombinant protein on the surface of S. cerevisiae. 
30 Vectors for mammalian display, e.g., the pDisplay™ 
vector (Invitrogen, Carlsbad, CA, USA), target 
recombinant proteins using an N-terminal cell surface 
targeting signal and a C-terminal transmembrane 
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anchoring domain of platelet derived growth factor 
receptor. 

A wide variety of vectors now exist that fuse 
proteins encoded by heterologous nucleic acids to the 
5 chromophore of the substrate-independent, intrinsically 
fluorescent green fluorescent protein from Aequorea 
victoria ("GFP") and its variants. These proteins are 
intrinsically fluorescent: the GFP-like chromophore is 
entirely encoded by its amino acid sequence and can 
10 fluoresce without requirement for cofactor or 
substrate . 

Structurally, the GFP-like chromophore 
comprises an 11-stranded (3-barrel (0-can) with a 
central a-helix, the central a-helix having a 

15 conjugated n-resonance system that includes two 

aromatic ring systems and the bridge between them. The 
n-resonance system is created by autocatalytic 
cyclization among amino acids; cyclization proceeds 
through an imidazolinone intermediate, with subsequent 

20 dehydrogenation by molecular oxygen at the Ca-C(3 bond 
of a participating tyrosine. 

The GFP-like chromophore can be selected from 
GFP-like chromophores found in naturally occurring 
proteins, such as A. victoria GFP (GenBank accession 

25 number AAA27721) , Renilla reniformis GFP, FP583 
(GenBank accession no. AF168419) (DsRed) , FP593 
(AF272711), FP483 (AF168420) , FP484 (AF168424), FP595 
(AF246709), FP486 (AF168421), FP538 (AF168423) , and 
FP506 (AF168422), and need include only so much of the 

30 native protein as is needed to retain the chromophore 1 s 
intrinsic fluorescence. Methods for determining the 
minimal domain required for fluorescence are known in 
the art. Li et al ., "Deletions of the Aequorea victoria 
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Green Fluorescent Protein Define the Minimal Domain 
Required for Fluorescence," J. Biol. Chem. 272:28545- 
28549 (1997). 

Alternatively, the GFP-like chromophore can 
5 be selected from GFP-like chromophores modified from 
those found in nature. Typically, such modifications 
are made to improve recombinant production in 
heterologous expression systems (with or without change 
in protein sequence) , to alter the excitation and/or 

10 emission spectra of the native protein, to facilitate 
purification, to facilitate or as a consequence of 
cloning, or are a fortuitous consequence of research 
investigation. 

The methods for engineering such modified 

15 GFP-like chromophores and testing them for fluorescence 
activity, both alone and as part of protein fusions, 
are well-known in the art. Early results of these 
efforts are reviewed in Heim et a 1 . , Curr. Biol. 6:178- 
182 (1996), incorporated herein by reference in its 

20 entirety; a more recent review, with tabulation of 

useful mutations, is found in Palm et al . , "Spectral 
Variants of Green Fluorescent Protein, " in Green 
Fluorescent Proteins , Conn (ed.), Methods Enzymol . vol. 
302, pp. 378 - 394 (1999), incorporated herein by 

25 reference in its entirety. A variety of such modified 
chromophores are now commercially available and can 
readily be used in the fusion proteins of the present 
invention . 

For example, EGFP ("enhanced GFP"), Cormack 
30 et al., Gene 173:33-38 (1996); U.S. Pat. Nos. 6,090,919 
and 5,804,387, is a red-shifted, human codon-optimized 
variant of GFP that has been engineered for brighter 
fluorescence, higher expression in mammalian cells, and 
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for an excitation spectrum optimized for use in flow 
cytometers. EGFP can usefully contribute a GFP-like 
chromophore to the fusion proteins of the present 
invention. A variety of EGFP vectors, both plasmid and 
5 viral, are available commercially (Clontech Labs, Palo 
Alto, CA, USA) , including vectors for bacterial 
expression, vectors for N-terminal protein fusion 
expression, vectors for expression of C-terminal 
protein fusions, and for bicistronic expression. 

10 Toward the other end of the emission 

spectrum, EBFP ("enhanced blue fluorescent protein") 
and BFP2 contain four amino acid substitutions that 
shift the emission from green to blue, enhance the 
brightness of fluorescence and improve solubility of 

15 the protein, Heim et al . , Curr. Biol. 6:178-182 (1996); 
Cormack et al., Gene 173:33-38 (1996). EBFP is 
optimized for expression in mammalian cells whereas 
BFP2, which retains the original jellyfish codons, can 
be expressed in bacteria; as is further discussed 

20 below, the host cell of production does not affect the 
utility of the resulting fusion protein. The GFP-like 
chromophores from EBFP and BFP2 can usefully be 
included in the fusion proteins of the present 
invention, and vectors containing these blue-shifted 

25 variants are available from Clontech Labs (Palo Alto, 
CA, USA) . 

Analogously, EYFP ("enhanced yellow 
fluorescent protein"), also available from Clontech 
Labs, contains four amino acid substitutions, different 
30 from EBFP, Ormo et al . , Science 273:1392-1395 (1996), 
that shift the emission from green to yellowish-green. 
Citrine, an improved yellow fluorescent protein mutant, 
is described in Heikal et al., Proc. Natl. Acad. Sci. 
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USA 97:11996-12001 (2000). ECFP ("enhanced cyan 
fluorescent protein") (Clontech Labs, Palo Alto, CA, 
USA) contains six amino acid substitutions, one of 
which shifts the emission spectrum from green to cyan. 
5 Heim et al., Curr. Biol. 6:178-182 (1996); Miyawaki et 
al. f Nature 388:882-887 (1997). The GFP-like 
chromophore of each of these GFP variants can usefully 
be included in the fusion proteins of the present 
invention . 

10 The GFP-like chromophore can also be drawn 

from other modified GFPs, including those described in 
U.S. Pat. Nos. 6,124,128; 6,096,865; 6,090,919; 
6,066,476; 6,054,321; 6,027,881; 5,968,750; 5,874,304; 
5,804,387; 5,777,079; 5,741,668; and 5,625,048, the 

15 disclosures of which are incorporated herein by 

reference in their entireties. See also Conn (ed.), 
Green Fluorescent Protein , Methods in 

Fusions to the IgG Fc region increase serum 
half life of protein pharmaceutical products through 

20 interaction with the FcRn receptor (also denominated 
the FcRp receptor and the Brambell receptor, FcRb) , 
further described in international patent application 
nos. WO 97/43316, WO 97/34631, WO 96/32478, WO 
96/18412. 

25 The present invention further includes host 

cells comprising the vectors of the present invention, 
either present episomally within the cell or 
integrated, in whole or in part, into the host cell 
chromosome . 

30 As noted earlier, host cells can be 

prokaryotic or eukaryotic. Representative examples of 
appropriate host cells include, but are not limited to, 
bacterial cells, such as E. coli, Caulobacter 
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crescentus, Streptomyces species, and Salmonella 
typhimurium; yeast cells, such as Saccharomyces 
cerevisiae , Schizosaccharomyces pombe, Pichia pastoris, 
Pichia methanolica; insect cell lines, such as those 
5 from Spodoptera frugiperda — e.g., Sf9 and Sf21 cell 
lines, and expresSF™ cells (Protein Sciences Corp., 
Meriden, CT, USA) — Drosophila S2 cells, and 
Trichoplusia nl High Five® Cells (Invitrogen, Carlsbad, 
CA, USA); and mammalian cells. Typical mammalian cells 

10 include COS1 and COS7 cells, Chinese hamster ovary 
(CHO) cells, NIH 3T3 cells, 293 cells, HEPG2 cells, 
HeLa cells, L cells, murine ES cell lines (e.g., from 
strains 129/SV, C57/BL6, DBA- 1 , 129/SVJ) , K562, Jurkat 
cells, and BW5147. Other mammalian cell lines are well 

15 known and readily available from the American Type 

Culture Collection (ATCC) (Manassas, VA, USA) and the 
National Institute of General medical Sciences (NIGMS) 
Human Genetic Cell Repository at the Coriell Cell 
Repositories (Camden, NJ, USA) . 

2 0 Methods for introducing the vectors and 

nucleic acids of the present invention into the host 
cells are well known in the art; the choice of 
technique will depend primarily upon the specific 
vector to be introduced. 

25 For example, phage lambda vectors will 

typically be packaged using a packaging extract (e.g., 
Gigapack® packaging extract, Stratagene, La Jolla, CA, 
USA), and the packaged virus used to infect E. coli. 
Plasmid vectors will typically be introduced into 

30 chemically competent or electrocompetent bacterial 
cells . 

E. coli cells can be rendered chemically 
competent by treatment, e.g., with CaCl 2 , or a solution 
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of Mg 2+ , Mh 2+ , Ca 2+ , Rb + or K + , dimethyl sulfoxide, 
dithiothreitol, and hexamine cobalt (III), Hanahan, J. 
Mol. Biol. 166(4) :557-80 (1983), and vectors introduced 
by heat shock. A wide variety of chemically competent 
strains are also available commercially (e.g., 
Epicurian Coli® XLIO-Gold® Ultracompetent Cells 
(Stratagene, La Jolla, CA, USA) ; DH5a competent cells 
(Clontech Laboratories, Palo Alto, CA, USA) ; TOP10 
Chemically Competent E. coli Kit (Invitrogen, Carlsbad, 
CA, USA) ) . 

Bacterial cells can be rendered 
electrocompetent - that is, competent to take up 
exogenous DNA by electroporation — by various pre-pulse 
treatments; vectors are introduced by electroporation 
followed by subsequent outgrowth in selected media. An 
extensive series of protocols is provided online in 
Electroprotocols (BioRad, Richmond, CA, USA) 
(http://www.bio- 

rad.com/LifeScience/pdf/New_Gene_Pulser.pdf) . 

Vectors can be introduced into yeast cells by 
spheroplasting, treatment with lithium salts, 
electroporation, or protoplast fusion. 

Spheroplasts are prepared by the action of 
hydrolytic enzymes - a snail-gut extract, usually 
denoted Glusulase, or Zymolyase, an enzyme from 
Arthrobacter luteus — to remove portions of the cell 
wall in the presence of osmotic stabilizers, typically 
1 M sorbitol. DNA is added to the spheroplasts, and 
the mixture is co-precipitated with a solution of 
polyethylene glycol (PEG) and Ca 2+ . Subsequently, the 
cells are resuspended in a solution of sorbitol, mixed 
with molten agar and then layered on the surface of a 
selective plate containing sorbitol. For lithium- 
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mediated transformation, yeast cells are treated with 
lithium acetate, which apparently permeabilizes the 
cell wall, DNA is added and the cells are 
co-precipitated with PEG. The cells are exposed to a 
5 brief heat shock, washed free of PEG and lithium 

acetate, and subsequently spread on plates containing 
ordinary selective medium. Increased frequencies of 
transformation are obtained by using specially-prepared 
single-stranded carrier DNA and certain organic 

10 solvents. Schiestl et al . , Curr. Genet. 16 (5-6) : 339-46 
(1989) . For electroporation, freshly-grown yeast 
cultures are typically washed, suspended in an osmotic 
protectant, such as sorbitol, mixed with DNA, and the 
cell suspension pulsed in an electroporation device. 

15 Subsequently, the cells are spread on the surface of 
plates containing selective media. Becker et al . , 
Methods Enzymol . 194:182-7 (1991). The efficiency of 
transformation by electroporation can be increased over 
100-fold by using PEG, single-stranded carrier DNA and 

20 cells that are in late log-phase of growth. Larger 
constructs, such as YACs, can be introduced by 
protoplast fusion. 

Mammalian and insect cells can be directly 
infected by packaged viral vectors, or transfected by 

25 chemical or electrical means. 

For chemical transf ection, DNA can be 
coprecipitated with CaP0 4 or introduced using liposomal 
and nonliposomal lipid-based agents. Commercial kits 
are available for CaP0 4 transfection (CalPhos™ Mammalian 

30 Transfection Kit, Clontech Laboratories, Palo Alto, CA, 
USA), and lipid-mediated transfection can be practiced 
using commercial reagents, such as LIPOFECTAMINE™ 2000, 
LIPOFECTAMINE™ Reagent, CELLFECTIN® Reagent, and 
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LIPOFECTIN® Reagent (Invitrogen, Carlsbad, CA, USA), 
DOTAP Liposomal Transfection Reagent, FuGENE 6, 
X-tremeGENE Q2, DOSPER, (Roche Molecular Biochemicals, 
Indianapolis, IN USA) , Effectene™, PolyFect®, Superfect® 
5 (Qiagen, Inc., Valencia, CA, USA), Protocols for 

electroporating mammalian cells can be found online in 
Electroprotocols (Bio-Rad, Richmond, CA, USA) 
(http: //www.bio- 

rad- com/LifeScience/pdf /New_Gene_Pulser .pdf ) . 
10 See also, Norton et al . (eds.), Gene Transfer Methods: 
Introducing DNA into Living Cells and Organisms , 
BioTechiques Books, Eaton Publishing Co. (2000) (ISBN 
1-881299-34-1), incorporated herein by reference in its 
entirety. 

15 PROTEINS 

In another aspect, the present invention 
provides PAPP-E isoform proteins, various fragments 
thereof suitable for use as antigens (e.g., for epitope 
mapping) and for use as immunogens (e.g., for raising 

20 antibodies or as vaccines), fusions of PAPP-E isoform 
polypeptides and fragments to heterologous 
polypeptides, and conjugates of the proteins, 
fragments, and fusions of the present invention to 
other moieties (e.g., to carrier proteins, to 

25 f luorophores) . 

FIGS. 3, 4, and 5 present the predicted amino 
acid sequences encoded by PAPP-Ea, PAPP-Eb, and PAPP-Ec 
cDNA clones. The amino acid sequences are further 
presented, respectively, in SEQ ID Nos : 3 (full length 

30 PAPP-Ea isoform), 7 (PAPPE-Ea isoform from aa 1 - 19), 
10 (full length PAPP-Eb isoform), 12 (amino acid 
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sequence entirely within the novel exon of PAPP-Eb (aa 
1735 - 1762)), 13 (amino acid sequence of PAPP-Eb 
resulting from the frame shift (aa 1763 - 1770) ) , 14 
(amino acids present uniquely within PappE-b, due to 
5 exon insertion followed by frameshift (aa 1735 - 

1770)), 16 (full length PAPP-Ec isoform) , 18 (20 amino 
acids centered about deletion of exon 21 in PAPP-Ec (aa 
298 - 317) ) . 

Unless otherwise indicated, amino acid 

10 sequences of the proteins of the present invention were 
determined as a predicted translation from a nucleic 
acid sequence. Accordingly, any amino acid sequence 
presented herein may contain errors due to errors in 
the nucleic acid sequence, as described in detail 

15 above. Furthermore, single nucleotide polymorphisms 
(SNPs) occur frequently in eukaryotic genomes - more 
than 1.4 million SNPs have already identified in the 
human genome, International Human Genome Sequencing 
Consortium, Nature 409:860 - 921 (2001) - and the 

20 sequence determined from one individual of a species 

may differ from other allelic forms present within the 
population. Small deletions and insertions can often 
be found that do not alter the function of the protein. 

Accordingly, it is an aspect of the present 

25 invention to provide proteins not only identical in 

sequence to those described with particularity herein, 
but also to provide isolated proteins at least about 
90% identical in sequence to those described with 
particularity herein, typically at least about 91%, 

30 92%, 93%, 94%, or 95% identical in sequence to those 
decribed with particularity herein, usefully at least 
about 96%, 97%, 98%, or 99% identical in sequence to 
those described with particularity herein, and, most 
conservatively, at least about 99.5%, 99.6%, 99.7%, 
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99.8% and 99.9% identical in sequence to those 
described with particularity herein. These sequence 
variants can be naturally occurring or can result from 
human intervention by way of random or directed 
5 mutagenesis . 

For purposes herein, percent identity of two 
amino acid sequences is determined using the procedure 
of Tatiana et al., "Blast 2 sequences - a new tool for 
comparing protein and nucleotide sequences", FEMS 
10 Microbiol Lett. 174:247-250 (1999), which procedure is 
effectuated by the computer program BLAST 2 SEQUENCES, 
available online at 

http: //www. ncbi .nlm.nih.gov/blast/bl2seq/bl2 .html, 

To assess percent identity of amino acid sequences, the 

15 BLASTP module of BLAST 2 SEQUENCES is used with default 
values of (i) BLOSUM62 matrix, Henikoff et al . , Proc. 
Natl. Acad. Sci USA 89 (22) : 10915-9 (1992); (ii) open 
gap 11 and extension gap 1 penalties; and (iii) gap 
x_dropoff 50 expect 10 word size 3 filter, and both 

20 sequences are entered in their entireties. 

As is well known, amino acid substitutions 
occur frequently among natural allelic variants, with 
conservative substitutions often occasioning only de 
minimis change in protein function. 

25 Accordingly, it is an aspect of the present 

invention to provide proteins not only identical in 
sequence to those described with particularity herein, 
but also to provide isolated proteins having the 
sequence of PAPP-E proteins, or portions thereof, with 

30 conservative amino acid substitutions, and to provide 
isolated proteins having the sequence of PAPP-E 
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proteins, and portions thereof, with moderately 
conservative amino acid substitutions. These 
conservatively-sustituted or moderately conservatively- 
substituted variants can be naturally occurring or can 
5 result from human intervention. 

Although there are a variety of metrics for 
calling conservative amino acid substitutions, based 
primarily on either observed changes among 
evolutionarily related proteins or on predicted 
10 chemical similarity, for purposes herein a conservative 
replacement is any change having a positive value in 
the PAM250 log-likelihood matrix reproduced herein 
below (see Gonnet et al., Science 256 (5062) : 1443-5 
(1992) ) : 
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V 0 -2 -2 -3 0 -2 -2-3-232-220 -2 -1 0-3-1 3 

For purposes herein, a "moderately conservative" 
replacement is any change having a nonnegative value in 
the PAM250 log-likelihood matrix reproduced herein 
5 above . 

As is also well known in the art, relatedness 
of proteins can also be characterized using a 
functional test, the ability of the encoding nucleic 
acids to base-pair to one another at defined 

10 hybridization stringencies. 

It is, therefore, another aspect of the 
invention to provide isolated proteins not only 
identical in sequence to those described with 
particularity herein, but also to provide isolated 

15 proteins ("hybridization related proteins") that are 
encoded by nucleic acids that hybridize under high 
stringency conditions (as defined herein above) to all 
or to a portion of various of the isolated PAPP-E 
nucleic acids of the present invention ("reference 

20 nucleic acids") . It is a further aspect of the 

invention to provide isolated proteins ("hybridization 
related proteins") that are encoded by nucleic acids 
that hybridize under moderate sringency conditions (as 
defined herein above) to all or to a portion of various 

25 of the isolated PAPP-E nucleic acids of the present 
invention. 

The hybridization related proteins can be 
alternative isoforms, homologues, paralogues, and 
orthologues of the PAPP-E proteins of the present 
30 invention. Particularly preferred orthologues are 

those from other primate species, such as chimpanzee, 
rhesus macaque, baboon, and gorilla, from rodents, such 
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as rats, mice, guinea pigs, and from livestock, such as 
cow, pig, sheep, horse, goat, 

Relatedness of proteins can also be 
characterized using a second functional test, the 
5 ability of a first protein competitively to inhibit the 
binding of a second protein to an antibody. 

It is, therefore, another aspect of the 
present invention to provide isolated proteins not only 
identical in sequence to those described with 

10 particularity herein, but also to provide isolated 

proteins ("cross-reactive proteins") that competitively 
inhibit the binding of antibodies to all or to a 
portion of various of the isolated PAPP-E proteins of 
the present invention ("reference proteins") . Such 

15 competitive inhibition can readily be determined using 
immunoassays well known in the art. 

Among the proteins of the present invention 
that differ in amino acid sequence from those described 
with particularity herein - including those that have 

20 deletions and insertions causing up to 10% non- 
identity, those having conservative or moderately 
conservative substitutions, hybridization related 
proteins, and cross-reactive proteins — those that 
substantially retain one or more PAPP-E activities are 

25 preferred. As described above, those activities 
include metalloprotease activity, specifically an 
ability to cleave an IGFBF, ability to 
heteromultimerize with serum proteins, such as 
eosinophil major basic protein (proMBP) , and the 

30 ability to control survival, growth, and/or 

differentiation of the dominant ovarian follicle. 

Residues that are tolerant of change while 
retaining function can be identified by altering the 
protein at known residues using methods known in the 
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art, such as alanine scanning mutagenesis, Cunningham 
et al., Science 244 (4908) : 1081-5 (1989); transposon 
linker scanning mutagenesis, Chen et al., Gene 
263 (1-2) : 39-48 (2001); combinations of homolog- and 
5 alanine-scanning mutagenesis, Jin et al . , J. Mol . Biol. 
226 (3) : 851-65 (1992); combinatorial alanine scanning, 
Weiss et al . , Proc. Natl. Acad. Sci USA 97 (16) : 8950-4 
(2000), followed by functional assay. 
Transposon linker scanning kits are available 

10 commercially (New England Biolabs, Beverly, MA, USA, 

catalog, no. E7-102S; EZ::TN™ In-Frame Linker Insertion 
Kit, catalogue no. EZI04KN, Epicentre Technologies 
Corporation, Madison, WI, USA) . 

As further described below, the isolated 

15 proteins of the present invention can readily be used 
as specific immunogens to raise antibodies that 
specifically recognize PAPP-E proteins, their isoforms, 
homologues, paralogues, and/or orthologues. The 
antibodies, in turn, can be used, Inter alia, 

20 specifically to assay for the PAPP-E proteins of the 
present invention — e.g. by EL ISA for detection of 
protein fluid samples, such as serum, by 
immunohistochemistry or laser scanning cytometry, for 
detection of protein in tissue samples, or by flow 

25 cytometry, for detection of intracellular protein in 
cell suspensions — for specific antibody-mediated 
isolation and/or purification of PAPP-E proteins, as 
for example by immunoprecipitation, and for use as 
specific agonists or antagonists of PAPP-E action. 

30 The isolated proteins of the present 

invention are also immediately available for use as 
specific standards in assays used to determine the 
concentration and/or amount specifically of the PAPP-E 
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proteins of the present invention. For example, ELISA 
kits for detection and quantitation of protein analytes 
include purified protein of known concentration for use 
as a measurement standard (e.g., the human inter feron-y 
5 OptEIA kit, catalog no. 555142, Pharmingen, San Diego, 
CA, USA includes human recombinant gamma interferon, 
baculovirus produced) . 

The isolated proteins of the present 
invention are also immediately available for use as 

10 specific biomolecule capture probes for surface- 
enhanced laser desorption ionization (SELDI) detection 
of protein-protein interactions, WO 98/59362; 
WO 98/59360; WO 98/59361; and Merchant et al . , 
Electrophoresis 21 ( 6) : 11 64-77 (2000), the disclosures 

15 of which are incorporated herein by reference in their 
entireties. The isolated proteins of the present 
invention are also immediately available for use as 
specific biomolecule capture probes on BIACORE surface 
plasmon resonance probes. 

20 The isolated proteins of the present 

invention are also useful as a therapeutic supplement 
in patients having a specific deficiency in PAPP-E 
production. 

In another aspect, the invention also 

25 provides fragments of various of the proteins of the 
present invention. The protein fragments are useful, 
inter alia, as antigenic and immunogenic fragments of a 
PAPP-E isoform. 

By "fragments" of a protein is here intended 

30 isolated proteins (equally, polypeptides, peptides, 
oligopeptides), however obtained, that have an amino 
acid sequence identical to a portion of the reference 
amino acid sequence, which portion is at least 6 amino 
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acids and less than the entirety of the reference 
nucleic acid. As so defined, "fragments" need not be 
obtained by physical fragmentation of the reference 
protein, although such provenance is not thereby 
5 precluded. 

Fragments of at least 6 contiguous amino 
acids are useful in mapping B cell and T cell epitopes 
of the reference protein. See, e.g., Geysen et al., 
"Use of peptide synthesis to probe viral antigens for 

10 epitopes to a resolution of a single amino acid, " Proc. 
Natl. Acad. Sci. USA 81:3998-4002 (1984) and U.S. Pat. 
Nos. 4,708,871 and 5,595,915, the disclosures of which 
are incorporated herein by reference in their 
entireties. Because the fragment need not itself be 

15 immunogenic, part of an immunodominant epitope, nor 
even recognized by native antibody, to be useful in 
such epitope mapping, all fragments of at least 6 amino 
acids of the proteins of the present invention have 
utility in such a study. 

2 0 Fragments of at least 8 contiguous amino 

acids, often at least 15 contiguous amino acids, have 
utility as immunogens for raising antibodies that 
recognize the proteins of the present invention. See, 
e.g., Lerner, "Tapping the immunological repertoire to 

25 produce antibodies of predetermined specificity, " 

Nature 299:592-596 (1982); Shinnick et al . , "Synthetic 
peptide immunogens as vaccines," Annu. Rev. Microbiol. 
37:425-46 (1983); Sutcliffe et a J . , "Antibodies that 
react with predetermined sites on proteins," Science 

30 219:660-6 (1983), the disclosures of which are 

incorporated herein by reference in their entireties. 
As further described in the above-cited references, 
virtually all 8-mers, conjugated to a carrier, such as 



- 101 - 

a protein, prove immunogenic — that is, prove capable 
of eliciting antibody for the conjugated peptide; 
accordingly, all fragments of at least 8 amino acids of 
the proteins of the present invention have utility as 
5 immunogens . 

Fragments of at least 8, 9, 10 or 12 
contiguous amino acids are also useful as competitive 
inhibitors of binding of the entire protein, or a 
portion thereof, to antibodies (as in epitope mapping) , 

10 and to natural binding partners, such as subunits in a 
multerimic complex or to receptors or ligands of the 
subject protein; this competitive inhibition permits 
identification and separation of molecules that bind 
specifically to the protein of interest, U.S. Pat. Nos. 

15 5,539,084 and 5,783,674, incorporated herein by 
reference in their entireties. 

The protein, or protein fragment, of the 
present invention is thus at least 6 amino acids in 
length, typically at least 8, 9, 10 or 12 amino acids 

20 in length, and often at least 15 amino acids in length. 
Often, the protein or the present invention, or 
fragment thereof, is at least 20 amino acids in length, 
even 25 amino acids, 30 amino acids, 35 amino acids, or 
50 amino acids or more in length. Of course, larger 

25 fragments having at least 75 amino acids, 100 amino 

acids, or even 150 amino acids are also useful, and at 
times preferred. 

The present invention further provides 
fusions of the proteins and protein fragments of the 

30 present invention to heterologous polypeptides. 

By fusion is here intended that the protein 
or protein fragment of the present invention is 
linearly contiguous to the heterologous polypeptide in 
a peptide-bonded polymer of amino acids or amino acid 
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analogues; by "heterologous polypeptide" is here 
intended a polypeptide that does not naturally occur in 
contiguity with the protein or protein fragment of the 
present invention. As so defined, the fusion can 
5 consist entirely of a plurality of fragments of the 
PAPP-E protein in altered arrangement; in such case, 
any of the PAPP-E fragments can be considered 
heterologous to the other PAPP-E fragments in the 
fusion protein. More typically, however, the 

10 heterologous polypeptide is not drawn from the PAPP-E 
protein itself. 

The fusion proteins of the present invention 
will include at least one fragment of the protein of 
the present invention, which fragment is at least 6, 

15 typically at least 8, often at least 15, and usefully 
at least 16, 17, 18, 19, or 20 amino acids long. The 
fragment of the protein of the present to be included 
in the fusion can usefully be at least 25 amino acids 
long, at least 50 amino acids long, and can be at least 

20 75, 100, or even 150 amino acids long. Fusions that 
include the entirety of the proteins of the present 
invention have particular utility. 

The heterologous polypeptide included within 
the fusion protein of the present invention is at least 

25 6 amino acids in length, often at least 8 amino acids 
in length, and usefully at least 15, 20, and 25 amino 
acids in length. Fusions that include larger 
polypeptides, such as the IgG Fc region, and even 
entire proteins (such as GFP chromophore-containing 

30 proteins), have particular utility. 

As described above in the description of 
vectors and expression vectors of the present 
invention, which discussion is incorporated herein by 
reference in its entirety, heterologous polypeptides to 
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be included in the fusion proteins of the present 
invention can usefully include those designed to 
facilitate purification and/or visualization of 
recombinantly-expressed proteins. Although 
5 purification tags can also be incorporated into fusions 
that are chemically synthesized, chemical synthesis 
typically provides sufficient purity that further 
purification by HPLC suffices; however, visualization 
tags as above described retain their utility even when 

10 the protein is produced by chemical synthesis, and when 
so included render the fusion proteins of the present 
invention useful as directly detectable markers of 
PAPP-E presence. 

As also discussed above, heterologous 

15 polypeptides to be included in the fusion proteins of 
the present invention can usefully include those that 
facilitate secretion of recombinantly expressed 
proteins — into the periplasmic space or extracellular 
milieu for prokaryotic hosts, into the culture medium 

20 for eukaryotic cells — through incorporation of 
secretion signals and/or leader sequences. 

Other useful protein fusions of the present 
invention include those that permit use of the protein 
of the present invention as bait in a yeast two-hybrid 

25 system. See Bartel et a J . (eds.), The Yeast Two-Hvbrid 
System , Oxford University Press (1997) (ISBN: 
0195109384); Zhu et al . , Yeast Hybrid Technologies , 
Eaton Publishing, (2000) (ISBN 1-881299-15-5); Fields 
et al., Trends Genet. 10(8):286-92 (1994); Mendelsohn 

30 et al., Curr. Opin. Biotechnol. 5(5): 482-6 (1994); 
Luban et al., Curr. Opln. Biotechnol. 6(l):59-64 
(1995) ; . Allen et al . , Trends Biochem. Sci. 20(12): 511-6 
(1995); Drees, Curr. Opin. Chem. Biol. 3(l):64-70 
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(1999); Topcu et al . , Pharm. Res. 17 ( 9) : 1049-55 (2000); 
Fashena et al., Gene 250 (1-2) : 1-14 (2000), the 
disclosures of which are incorporated herein by 
reference in their entireties. Typically, such fusion 
5 is to either E. coli LexA or yeast GAL4 DNA binding 
domains. Related bait plasmids are available that 
express the bait fused to a nuclear localization 
signal . 

Other useful protein fusions include those 
10 that permit display of the encoded protein on the 

surface of a phage or cell, fusions to intrinsically 
fluorescent proteins, such as green fluorescent protein 
(GFP) , and fusions to the IgG Fc region. 

The proteins and protein fragments of the 
15 present invention can also usefully be fused to protein 
toxins, such as Pseudomonas exotoxin A, diphtheria 
toxin, shiga toxin A, anthrax toxin lethal factor, 
ricin, in order to effect ablation of cells that bind 
or take up the proteins of the present invention. 
20 The isolated proteins, protein fra:gments, and 

protein fusions of the present invention can be 
composed of natural amino acids linked by native 
peptide bonds, or can contain any or all of nonnatural 
amino acid analogues, nonnative bonds, and post- 
25 synthetic (post translational ) modifications, either 
throughout the length of the protein or localized to 
one or more portions thereof. 

As is well known in the art, when the 
isolated protein is used, e.g., for epitope mapping, 
30 the range of such nonnatural analogues, nonnative 

inter-residue bonds, or post-synthesis modifications 
will be limited to those that permit binding of the 
peptide to antibodies. When used as an immunogen for 
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the preparation of antibodies in a non-human host, such 
as a mouse, the range of such nonnatural analogues, 
nonnative inter-residue bonds, or post-synthesis 
modifications will be limited to those that do not 
5 interfere with the immunogenicity of the protein. When 
the isolated protein is used as a therapeutic agent, 
such as a vaccine or for replacement therapy, the range 
of such changes will be limited to those that do not 
confer toxicity upon the isolated protein. 
10 Non-natural amino acids can be incorporated 

during solid phase chemical synthesis or by recombinant 
techniques, although the former is typically more 
common . 

For example, D-enantiomers of natural amino 

15 acids can readily be incorporated during chemical 
peptide synthesis: peptides assembled from D-amino 
acids are more resistant to proteolytic attack; 
incorporation of D-enantiomers can also be used to 
confer specific three dimensional conformations on the 

20 peptide. Other amino acid analogues commonly added 
during chemical synthesis include ornithine, 
norleucine, phosphorylated amino acids (typically 
phosphoserine, phosphothreonine, phosphotyrosine) , 
L-malonyltyrosine, a non-hydrolyzable analog of 

25 phosphotyrosine (Kole et al. t Biochem. Biophys. Res. 
Com. 209:817-821 (1995)), and various halogenated 
phenylalanine derivatives. 

Amino acid analogues having detectable labels 
are also usefully incorporated during synthesis to 

30 provide a labeled polypeptide. 

Biotin, for example, can be added using 
biotinoyl — (9-f luorenylmethoxycarbonyl ) -L-lysine (FMOC 
biocytin) (Molecular Probes, Eugene, OR, USA) . The 
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FMOC and tBOC derivatives of dabcyl-L-lysine (Molecular 
Probes, Inc., Eugene, OR, USA) can be used to 
incorporate the dabcyl chromophore at selected sites in 
the peptide sequence during synthesis. The 
5 aminonaphthalene derivative EDANS, the most common 
fluorophore for pairing with the dabcyl quencher in 
fluorescence resonance energy transfer (FRET) systems, 
can be introduced during automated synthesis of 
peptides by using EDANS — FMOC-L-glutamic acid or the 

10 corresponding tBOC derivative (both from Molecular 

Probes, Inc., Eugene, OR, USA). Tetramethylrhodamine 
fluorophores can be incorporated during automated FMOC 
synthesis of peptides using (FMOC) — TMR-L-lysine 
(Molecular Probes, Inc. Eugene, OR, USA) . 

15 Other useful amino acid analogues that can be 

incorporated during chemical synthesis include aspartic 
acid, glutamic acid, lysine, and tyrosine analogues 
having allyl side-chain protection (Applied Biosystems, 
Inc., Foster City, CA, USA); the allyl side chain 

20 permits synthesis of cyclic, branched-chain, 

sulfonated, glycosylated, and phosphorylated peptides. 

A large number of other FMOC-protected non- 
natural amino acid analogues capable of incorporation 
during chemical synthesis are available commercially, 

25 including, e.g., Fmoc-2-aminobicyclo [2.2.1] heptane-2- 
carboxylic acid, Fmoc-3-endo- 

aminobicyclo [2.2.1] heptane-2-endo-carboxylic acid, 
Fmoc-3-exo-aminobicyclo [2.2. 1 ] heptane-2-exo-carboxylic 
acid, Fmoc-3-endo-amino-bicyclo [2 .2.1] hept-5-ene-2- 
30 endo-carboxylic acid, Fmoc-3-exo-amino- 

bicyclo [2.2.1] hept-5-ene-2-exo-carboxylic acid, Fmoc- 
cis-2- amino- 1-cyclohexanecarboxylic acid, Fmoc-trans-2- 
amino-l-cyclohexanecarboxylic acid, Fmoc-1 -amino- 1- 
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cyclopentanecarboxylic acid, Fmoc-cis-2-amino-l- 
cyclopentanecarboxylic acid, Fmoc- 1- amino- 1- 
cyclopropanecarboxylic acid, Fmoc-D-2-amino-4- 
(ethylthio) butyric acid, Fmoc-L-2-amino-4- 
5 (ethylthio) butyric acid, Fmoc-L-buthionine, Fmoc-S- 
methyl-L-Cysteine, Fmoc-2-aminobenzoic acid 
(anthranillic acid) , Fmoc-3-aminobenzoic acid, Fmoc-4- 
aminobenzoic acid, Fmoc-2-aminobenzophenone-2 1 - 
carboxylic acid, Fmoc-N- ( 4-aminobenzoyl) -b-alanine, 

10 Fmoc- 2 -amino- 4 , 5-dimethoxybenzoic acid, Fmoc- 4- 

aminohippuric acid, Fmoc-2 -amino- 3-hydroxybenzoic acid, 
Fmoc- 2 -amino- 5-hydroxybenzoic acid, Fmoc- 3 -amino- 4- 
hydroxybenzoic acid, Fmoc-4-amino-3-hydroxybenzoic 
acid, Fmoc- 4 -amino- 2 -hydroxybenzoic acid, Fmoc- 5- amino - 

15 2 -hydroxybenzoic acid, Fmoc-2 -amino-3-methoxybenzoic 

acid, Fmoc- 4 -amino- 3-methoxybenzoic acid, Fmoc-2 -amino- 
3-methylbenzoic acid, Fmoc-2-amino-5-methylbenzoic 
acid, Fmoc- 2 -amino- 6-methylbenzoic acid, Fmoc-3-amino- 
2-methylbenzoic acid, Fmoc-3-amino-4-methylbenzoic 

20 acid, Fmoc-4-amino-3-methylbenzoic acid, Fmoc-3-amino- 
2-naphtoic acid, Fmoc-D, L-3-amino-3-phenylpropionic 
acid, Fmoc-L-Methyldopa, Fmoc-2-amino-4 , 6-dimethyl-3- 
pyridinecarboxylic acid, Fmoc-D, L-?-amino-2- 
thiophenacetic acid, Fmoc-4- (carboxymethyl) piperazine, 

25 Fmoc-4-carboxypiperazine, Fmoc-4- 

(carboxymethyl) homopiperazine, Fmoc-4-phenyl-4- 
piperidinecarboxylic acid, Fmoc-L-1, 2, 3, 4- 
tetrahydronorharman-3-carboxylic acid, Fmoc-L- 
thiazolidine-4-carboxylic acid, all available from The 

30 Peptide Laboratory (Richmond, CA, USA) . 

Non-natural residues can also be added 
biosynthetically by engineering a suppressor tRNA, 
typically one that recognizes the UAG stop codon, by 
chemical aminoacylation with the desired unnatural 
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amino acid and. Conventional site-directed mutagenesis 
is used to introduce the chosen stop codon UAG at the 
site of interest in the protein gene. When the 
acylated suppressor tRNA and the mutant gene are 
5 combined in an in vitro transcription/ translation 
system, the unnatural amino acid is incorporated in 
response to the UAG codon to give a protein containing 
that amino acid at the specified position. Liu et a J . , 
Proc. Natl Acad. Sci. USA 96(9):4780-5 (1999). 

10 The isolated proteins, protein fragments and 

fusion proteins of the present invention can also 
include nonnative inter-residue bonds, including bonds 
that lead to circular and branched forms. 

The isolated proteins and protein fragments 

15 of the present invention can also include post- 

translational and post-synthetic modifications, either 
throughout the length of the protein or localized to 
one or more portions thereof. 

For example, when produced by recombinant 

20 expression in eukaryotic cells, the isolated proteins, 
fragments, and fusion proteins of the present invention 
will typically include N-linked and/or O-linked 
glycosylation, the pattern of which will reflect both 
the availability of glycosylation sites on the protein 

25 sequence and the identity of the host cell. Further 

modification of glycosylation pattern can be performed 
enzymatically . 

As another example, recombinant polypeptides 
of the invention may also include an initial modified 

30 methionine residue, in some cases resulting from host- 
mediated processes. 

When the proteins, protein fragments, and 
protein fusions of the present invention are produced 
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by chemical synthesis, post-synthetic modification can 
be performed before deprotection and cleavage from the 
resin or after deprotection and cleavage. Modification 
before deprotection and cleavage of the synthesized 
5 protein often allows greater control, e.g. by allowing 
targeting of the modifying moiety to the N-terminus of 
a resin-bound synthetic peptide. 

Useful post-synthetic (and post- 
translational) modifications include conjugation to 
10 detectable labels, such as f luorophores . 

A wide variety of amine-reactive and thiol- 
reactive fluorophore derivatives have been synthesized 
that react under nondenaturating conditions with N- 
terminal amino groups and epsilon amino groups of 
15 lysine residues, on the one hand, and with free thiol 
groups of cysteine residues, on the other. 

Kits are available commercially that permit 
conjugation of proteins to a variety of amine-reactive 
or thiol-reactive f luorophores : Molecular Probes, Inc. 
20 (Eugene, OR, USA), e.g., offers kits for conjugating 
proteins to Alexa Fluor 350, Alexa Fluor 430, 
Fluorescein-EX, Alexa Fluor 488, Oregon Green 488, 
Alexa Fluor 532, Alexa Fluor 546, Alexa Fluor 546, 
Alexa Fluor 568, Alexa Fluor 594, and Texas Red-X. 
25 A wide variety of other amine-reactive and 

thiol-reactive fluorophores are available commercially 
(Molecular Probes, Inc., Eugene, OR, USA), including 
Alexa Fluor® 350, Alexa Fluor® 488, Alexa Fluor® 532, 
Alexa Fluor® 546, Alexa Fluor® 568, Alexa Fluor® 594, 
30 Alexa Fluor® 647 (monoclonal antibody labeling kits 
available from Molecular Probes, Inc., Eugene, OR, 
USA), BODIPY dyes, such as BODIPY 493/503, BODIPY FL, 
BODIPY R6G, BODIPY 530/550, BODIPY TMR, BODIPY 558/568, 
BODIPY 558/568, BODIPY 564/570, BODIPY 576/589, BODIPY 
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581/591, BODIPY TR, BODIPY 630/650, BODIPY 650/665, 
Cascade Blue, Cascade Yello, Dansyl, lissamine 
rhodamine B, Marina Blue, Oregon Green 488, Oregon 
Green 514, Pacific Blue, rhodamine 6G, rhodamine green, 
rhodamine red, tetramethylrhodamine, Texas Red 
(available from Molecular Probes, Inc., Eugene, OR, 
USA) . 

The polypeptides of the present invention can 
also be conjugated to f luorophores, other proteins, and 
other macromolecules, using bifunctional linking 
reagents . 

Common homobi functional reagents include, 
e.g., APG, AEDP, BASED, BMB, BMDB, BMH, BMOE, BM[PEO]3, 
BM[PEO]4, BS3, BSOCOES, DFDNB, DMA, DMP, DMS, DPDPB, 
DSG, DSP (Lomant's Reagent), DSS, DST, DTBP, DTME, 
DTSSP, EGS, HBVS, Sulfo-BSOCOES, Sulfo-DST, Sulfo-EGS 
(all available Pierce, Rockford, IL, USA) ; common 
heterobifunctional cross-linkers include ABH, AMAS, 
ANB-NOS, APDP, ASBA, BMPA, BMPH, BMPS, EDC, EMC A, EMCH, 
EMCS, KMUA, KMUH, GMBS, LC-SMCC, LC-SPDP, MBS, M2C2H, 
MPBH, MSA, NHS -ASA, PDPH, PMPI, SADP, SAED, SAND, 
SANPAH, SASD, SATP, SBAP, SFAD, SIA, SIAB, SMCC, SMPB, 
SMPH, SMPT, SPDP, Sulfo-EMCS, Sulfo-GMBS, Sulfo-HSAB, 
Sulfo-KMUS, Sulfo-LC-SPDP, Sulfo-MBS, Sulf o-NHS-LC-ASA, 
Sulfo-SADP, Sulfo-SANPAH, Sulfo-SIAB, Sulfo-SMCC, 
Sulfo-SMPB, Sulfo-LC-SMPT, SVSB, TFCS (all available 
Pierce, Rockford, IL, USA) . 

The proteins, protein fragments, and protein 
fusions of the present invention can be conjugated, 
using such cross-linking reagents, to fluorophores that 
are not amine- or thiol-reactive. 

Other labels that usefully can be conjugated 
to the proteins, protein fragments, and fusion proteins 
of the present invention include radioactive labels, 



- Ill - 



echosonographic contrast reagents, and MR I contrast 
agents . 

The proteins, protein fragments, and protein 
fusions of the present invention can also usefully be 
5 conjugated using cross-linking agents to carrier 

proteins, such as KLH, bovine thyroglobulin, and even 
bovine serum albumin (BSA) , to increase immunogenicity 
for raising anti-PAPP-E antibodies . 

The proteins, protein fragments, and protein 

10 fusions of the present invention can also usefully be 
conjugated to polyethylene glycol (PEG); PEGylation 
increases the serum half life of proteins administered 
intravenously for replacement therapy. Delgado et al . , 
Crit. Rev. Ther. Drug Carrier Syst. 9 (3-4 ): 249-304 

15 (1992); Scott et al . , Curr. Pharm. Des . 4(6):423-38 
(1998); DeSantis et al . , Curr. Opin. Biotechnol . 
10(4): 324-30 (1999), incorporated herein by reference 
in their entireties. PEG monomers can be attached to 
the protein directly or through a linker, with 

20 PEGylation using PEG monomers activated with tresyl 
chloride (2,2, 2-trif luoroethanesulphonyl chloride) 
permitting direct attachment under mild conditions. 

The isolated proteins of the present 
invention, including fusions thereof, can be produced 

25 by recombinant expression, typically using the 

expression vectors of the present invention as above- 
described or, if fewer than about 100 amino acids, by 
chemical synthesis (typically, solid phase synthesis) , 
and, on occasion, by in vitro translation. 

30 Production of the isolated proteins of the 

present invention can optionally be followed by 
purification. 
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Purification of recombinantly expressed 
proteins is now well within the skill in the art. See, 
e.g., Thorner et al . (eds.), Applications of Chimeric 
Genes and Hybrid Protei ns, Part A: Gene Expression and 
Protein Purification (Methods in Enzymology, Volume 
326), Academic Press (2000), (ISBN: 0121822273); Harbin 
(ed«)/ Cloning , Gene Expression and Protein 
Purification : Experim ental Procedures and Process 
Rationale, Oxford Univ. Press (2001) (ISBN: 
0195132947); Marshak et al., Strategies for Protein 
Purification and Char acterization: A Laboratory Course 
Manual, Cold Spring Harbor Laboratory Press (1996) 
(ISBN: 0-87969-385-1); and Roe (ed.), Protein 
Purification Ap plications . Oxford University Press 
(2001) , the disclosures of which are incorporated 
herein by reference in their entireties, and thus need 
not be detailed here. 

Briefly, however, if purification tags have 
been fused through use of an expression vector that 
appends such tag, purification can be effected, at 
least in part, by means appropriate to the tag, such as 
use of immobilized metal affinity chromatography for 
polyhistidine tags. Other techniques common in the art 
include ammonium sulfate fractionation, 

immunoprecipitation, fast protein liquid chromatography 
(FPLC), high performance liquid chromatography (HPLC) , 
and preparative gel electrophoresis. 

Purification of chemically-synthesized 
peptides can readily be effected, e.g., by HPLC. 

Accordingly, it is an aspect of the present 
invention to provide the isolated proteins of the 
present invention in pure or substantially pure form. 
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A purified protein of the present invention 
is an isolated protein, as above described, that is 
present at a concentration of at least 95%, as measured 
on a mass basis with respect to total protein in a 
5 composition. Such purities can often be obtained 

during chemical synthesis without further purification, 
as, e.g., by HPLC. Purified proteins of the present 
invention can be present at a concentration (measured 
on a mass basis with respect to total protein in a 

10 composition) of 96%, 97%, 98%, and even 99%. The 

proteins of the present invention can even be present 
at levels of 99.5%, 99.6%, and even 99.7%, 99.8%, or 
even 99.9% following purification, as by HPLC. 

Although high levels of purity are preferred 

15 when the isolated proteins of the present invention are 
used as therapeutic agents — such as vaccines, or for 
replacement therapy — the isolated proteins of the 
present inventiqn are also useful at lower purity. For 
example, partially purified proteins of the present 

20 invention can be used as immunogens to raise antibodies 
in laboratory animals. 

Thus, in another aspect, the present 
invention provides the isolated proteins of the present 
invention in substantially purified form. A 

25 "substantially purified protein" of the present 

invention is an isolated protein, as above described, 
present at a concentration of at least 70%, measured on 
a mass basis with respect to total protein in a 
composition. Usefully, the substantially purified 

30 protein is present at a concentration, measured on a 
mass basis with respect to total protein in a 
composition, of at least 75%, 80%, or even at least 
85%, 90%, 91%, 92%, 93%, 94%, 94.5% or even at least 
94.9%. 
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In preferred embodiments, the purified and 
substantially purified proteins of the present 
invention are in compositions that lack detectable 
ampholytes, acrylamide monomers, bis-acrylamide 
5 monomers, and polyacrylamide . 

The proteins, fragments, and fusions of the 
present invention can usefully be attached to a 
substrate. The substrate can porous or solid, planar 
or non-planar; the bond can be covalent or noncovalent. 

10 For example, the proteins, fragments, and 

fusions of the present invention can usefully be bound 
to a porous substrate, commonly a membrane, typically 
comprising nitrocellulose, polyvinylidene fluoride 
(PVDF) , or cationically derivatized, hydrophilic PVDF; 

15 so bound, the proteins, fragments, and fusions of the 
present invention can be used to detect and quantify 
antibodies, e.g. in serum, that bind specifically to 
the immobilized protein of the present invention. 

As another example, the proteins, fragments, 

20 and fusions of the present invention can usefully be 
bound to a substantially nonporous substrate, such as 
plastic, to detect and quantify antibodies, e.g. in 
serum, that bind specifically to the immobilized 
protein of the present invention. Such plastics 

25 include polymethylacrylic, polyethylene, polypropylene, 
polyacrylate, polymethylmethacrylate, 
polyvinylchloride, polytetraf luoroethylene, 
polystyrene, polycarbonate, polyacetal, polysulfone, 
celluloseacetate, eel lulosenit rate, nitrocellulose, or 

30 mixtures thereof; when the assay is performed in 

standard microtiter dish, the plastic is typically 
polystyrene. 

The proteins, fragments, and fusions of the 
present invention can also be attached to a substrate 
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suitable for use as a surface enhanced laser desorption 
ionization source; so attached, the protein, fragment, 
or fusion of the present invention is useful for 
binding and then detecting secondary proteins that bind 
with sufficient affinity or avidity to the surface- 
bound protein to indicate biologic interaction 
therebetween. The proteins, fragments, and fusions of 
the present invention can also be attached to a 
substrate suitable for use in surface plasmon resonance 
detection; so attached, the protein, fragment, or 
fusion of the present ivnention is useful for binding 
and then detecting secondary proteins that bind with 
sufficient affinity or avidity to the surface-bound 
protein to indicate biological interaction 
therebetween . 

PAPP-E Isoform Proteins 

In a first series of protein embodiments, the 
invention provides an isolated PAPP-E polypeptide 
20 having an amino acid sequence encoded by the cDNA in 

ATCC Deposit No. , or the amino acid sequence in 

SEQ ID NO: 3, which are full length human PAPP-Ea 
isoforms. The invention further provides isolated 
PAPP-E polypeptides having an amino acid sequence 

25 encoded by the cDNA in ATCC Deposit No. , or the 

amino acid sequence in SEQ ID NO: 10, which are full 
length human PAPP-Eb isoforms. The invention also 
provides isolated PAPP-E polypeptides having an amino 
acid sequence encoded by the cDNA in ATCC Deposit No. 

30 , or the amino acid sequence in SEQ ID NO: 16, 

which are full length human PAPP-Ec isoforms. 

When used as immunogens, the full length 
proteins of the present invention can be used, inter 
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alia, to elicit antibodies that bind to epitopes that 
are common to all known PAPP-E isoforms. Such epitopes 
are encoded by any of exons 2 - 20, and by that portion 
of exon 1 translated in the PAPP-Ef isoform. When such 
5 antibodies are used for analytical assay of PAPP-E — 
e.g., in an ELISA intended to report the presence 
and/or amount of all isoforms, without distinction 
thereamong — any of the full length proteins can be 
used as a standard. 

10 When used as immunogens, the full length 

proteins of the present invention can be used, inter 
alia, to elicit antibodies that bind to an epitope that 
is shared by PAPP-Ea, PAPP-Eb, and PAPP-Ec but absent 
from PAPP-Ef: such epitopes are encoded by that portion 

15 of exon 1 not translated in PAPP-Ef. Such antibodies 
are identified by counterscreening using PAPP-Ef 
protein. When such antibodies are used for analytical 
assay of PAPP-E — e.g., an ELISA intended to report the 
amount of PAPP-Ea, PAPP-Eb, and PAPP-Ec, but not PAPP- 

20 Ef — any of the full length proteins can be used as a 
standard. 

When used as an immunogen, the full length 
PAPP-Eb protein can be used, inter alia, as an 
immunogen to elicit antibodies that bind to an epitope 

25 unique to the PAPP-Eb isoform. Such epitopes are 

encoded by exon 21 and the translated portion of exon 
22. Such antibodies are identified by counterscreening 
using PAPP-Ea, PAPP-Ec, and/or PAPP-Ef isoforms. When 
such antibodies are used for analytical assay of PAPP- 

30 E — e.g., an ELISA intended to report the presence 
and/or amount of PAPP-Eb — the full length PAPP-Eb 
protein can be used uniquely among the isoforms as a 
standard. 



- 117 - 

The invention further provides fragments of 
the above-described polypeptides, particularly 
fragments having at least 6 amino acids, typically at 
least 8 amino acids, often at least 15 amino acids, and 
5 even the entirety of the sequence given in SEQ ID NO: 7. 
This fragment (amino acids 1 - 19 of PAPP-Ea, -Eb, and 
Ec) is common to PAPP-Ea, PAPP-Eb and PAPP-Ec isoforms 
but absent from PAPP-Ef. These protein fragments can 
thus be used to identify and/or generate antibodies 

10 that recognize PAPP-Ea, PAPP-Eb, and PAPP-Ec isoforms 
without distinction thereamong, but that do not 
recognize PAPP-Ef. 

The invention further provides fragments of 
at least 6 amino acids, typically at least 8 amino 

15 acids, often at least 15 amino acids, and even the 

entirety of the sequence given in SEQ ID NO: 12, which 
is encoded by the exon that is novel in PAPP-Eb, The 
fragments have particular utility in identifying and in 
generating antibodies that recognize epitopes unique to 

20 the PAPP-Eb isoform. 

The invention further provides fragments of 
at least 6 amino acids, typically at least 8 amino 
acids, often at least 15 amino acids, and even the 
entirety of the sequence given in SEQ ID NO: 13, the 

25 coding sequence of the PAPP-Eb isoform that results 
from frameshift relative to PAPP-Ea isoform. These 
fragments have particular utility in identifying and in 
generating antibodies that recognize epitopes unique to 
the PAPP-Eb isoform. 

30 The invention further provides fragments of 

at least 6 amino acids, typically at least 8 amino 
acids, often at least 15 amino acids, and even the 
entirety of the sequence given in SEQ ID NO: 14, the 
coding sequence present uniquely in the PAPP-Eb 
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isoforitu These fragments have particular utility in 
dientifying and in generating antibodies that recognize 
epitopes unique to the PAPP-Eb isoform. 

The invention further provides fragments of 
at least 6 amino acids, typically at least 8 amino 
acids, often at least 15 amino acids, and even the 
entirety of the sequence given in SEQ ID NO: 18, the 
twenty amino acids centered about the exon 21 deletion 
in PAPP-Ec. These fragments have particular utility in 
identifying and generating antibodies that recognize 
epitopes created in the PAPP-Ec isoform due to absence 
of exon 21. 

As described above, the invention further 
provides proteins that differ in sequence from those 
described with particularity in the above-referenced 
SEQ ID NOs., whether by way of insertion or deletion, 
by way of conservative or moderately conservative 
substitutions, as hybridization related proteins, or as 
cross-hybridizing proteins, with those that 
substantially retain a PAPP-E activity preferred. 

The invention further provides fusions of the 
proteins and protein fragments herein described to 
heterologous polypeptides, 

ANTIBODIES AND ANT I BODY- PRODUCING CELLS 

In another aspect, the invention provides 
antibodies, including fragments and derivatives 
thereof, that bind specifically to one or more of the 
PAPP-E proteins and protein fragments of the present 
invention or to one or more of the proteins and protein 
fragments encoded by the isolated PAPP-E nucleic acids 
of the present invention. The antibodies of the 
present invention specifically recognize any or all of 
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linear epitopes, discontinuous epitopes, or 
conformational epitopes of such proteins or protein 
fragments, either as present on the protein in its 
native conformation or, in some cases, as present on 
5 the proteins as denatured, as, e.g., by solubilization 
in SDS. 

In other embodiments, the invention provides 
antibodies, including fragments and derivatives 
thereof, the binding of which can be competitively 
10 inhibited by one or more of the PAPP-E proteins and 

protein fragments of the present invention, or by one 
or more of the proteins and protein fragments encoded 
by the isolated PAPP-E nucleic acids of the present 
invention. 

15 4 As used herein, the term "antibody" refers to 

a polypeptide, at least a portion of which is encoded 
by at least one immunoglobulin gene, which can bind 
specifically to a first molecular species, and to 
fragments or derivatives thereof that remain capable of 

20 such specific binding. 

By "bind specifically" and "specific binding" 
is here intended the ability of the antibody to bind to 
a first molecular species in preference to binding to 
other molecular species with which the antibody and 

25 first molecular species are admixed. An antibody is 
said specifically to "recognize" a first molecular 
species when it can bind specifically to that first 
molecular species . 

As is well known in the art, the degree to 

30 which an antibody can discriminate as among molecular 
species in a mixture will depend, in part, upon the 
conformational relatedness of the species in the 
mixture; typically, the antibodies of the present 
invention will discriminate over adventitious binding 
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to non-PAPP-E proteins by at least two-fold, more 
typically by at least 5-fold, typically by more than 
10-fold, 25-fold, 50-fold, 75-fold, and often by more 
than 100-fold, and on occasion by more than 500-fold or 
5 1000-fold. When used to detect the proteins or protein 
fragments of the present invention, the antibody of the 
present invention is sufficiently specific when it can 
be used to determine the presence of the protein of the 
present invention in human serum. 

10 Typically, the affinity or avidity of an 

antibody (or antibody multimer, as in the case of an 
IgM pentamer) of the present invention for a protein or 
protein fragment of the present invention will be at 
least about 1 x 10" 6 molar (M) , typically at least about 

15 5 x 10~ 7 M, usefully at least about 1 x 10" 7 M, with 
affinities and avidities of at least 1 x 10~ 8 M, 
5 x 10" 9 M, and 1 x 10" 10 M proving especially useful. 

The antibodies of the present invention can 
be naturally-occurring forms, such as IgG, IgM, IgD, 

20 IgE, and IgA, from any mammalian species. 

Human antibodies can, but will infrequently, 
be drawn directly from human donors or human cells. In 
such case, antibodies to the proteins of the present 
invention will typically have resulted from fortuitous 

25 immunization, such as autoimmune immunization, with the 
protein or protein fragments of the present invention. 
Such antibodies will typically, but will not 
invariably, be polyclonal. 

Human antibodies are more frequently obtained 

30 using transgenic animals that express human 

immunoglobulin genes, which transgenic animals can be 
affirmatively immunized with the protein immunogen of 
the present invention. Human Ig-transgenic mice 
capable of producing human antibodies and methods of' 
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producing human antibodies therefrom upon specific 
immunization are described, inter alia, in U.S. Patent 
Nos. 6,162,963; 6,150,584; 6,114,598; 6, 075, 181; 
5, 939,598; 5, 877, 397; 5, 874,299; 5,814,318; 5,789, 650; 
5, 770,429; 5,661,016; 5, 633,425; 5, 625,126; 5, 569,825; 
5,545,807; 5,545,806, and 5,591,669, the disclosures of 
which are incorporated herein by reference in their 
entireties. Such antibodies are typically monoclonal, 
and are typically produced using techniques developed 
for production of murine antibodies . 

Human antibodies are particularly useful, and 
often preferred, when the antibodies of the present 
invention are to be administered to human beings as in 
vivo diagnostic or therapeutic agents, since recipient 
immune response to the administered antibody will often 
be substantially less than that occasioned by 
administration of an antibody derived from another 
species, such as mouse. 

IgG, IgM, IgD, IgE and IgA antibodies of the 
present invention are also usefully obtained from other 
mammalian species, including rodents - typically mouse, 
but also rat, guinea pig, and hamster - lagomorphs, 
typically rabbits, and also larger mammals, such as 
sheep, goats, cows, and horses. In such cases, as with 
the transgenic human- antibody-producing non-human 
mammals, fortuitous immunization is not required, and 
the non-human mammal is typically affirmatively 
immunized, according to standard immunization 
protocols, with the protein or protein fragment of the 
present invention. 

As discussed above, virtually all fragments 
of 8 or more contiguous amino acids of the proteins of 
the present invention can be used effectively as 
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immunogens when conjugated to a carrier, typically a 
protein such as bovine thryoglobulin, keyhole limpet 
hemocyanin, or bovine serum albumin, conveniently using 
a bifunctional linker such as those described elsewhere 
above, which discussion is incorporated by reference 
here. 

Immunogenicity can also be conferred by 
fusion of the proteins and protein fragments of the 
present invention to other moieties. 

For example, peptides of the present 
invention can be produced by solid phase synthesis on a 
branched polylysine core matrix; these multiple 
antigenic peptides (MAPs) provide high purity, 
increased avidity, accurate chemical definition and 
improved safety in vaccine development. Tarn et al., 
Proc. Natl. Acad. Sci. USA 85:5409-5413 (1988); Posnett 
et al., J. Biol. Chem. 263, 1719-1725 (1988). 

Protocols for immunizing non-human mammals 
are well-established in the art, Harlow et al . (eds.), 
ftntifeodi^s; * Laboratory Manual, Cold Spring Harbor 
Laboratory (1998) (ISBN: 0879693142); Coligan et al. 
(eds.), Current PrgtQ£ols in Immunolo gy, John Wiley & 
Sons, inc. (2001) (ISBN: 0-471-52276-7); Zola, 
Monoclonal Antibodie s • Pre paration and Use of 
Monoclonal Antibodi ^ and Engineered Antibody 
nprivativp^ (Basic s - From Background to Bench) , 
Springer Verlag (2000) (ISBN: 0387915907), the 
disclosures of which are incorporated herein by 
reference, and often include multiple immunizations, 
either with or without adjuvants such as Freund's 
complete adjuvant and Freund's incomplete adjuvant. 

Antibodies from nonhuman mammals can be 
polyclonal or monoclonal, with polyclonal antibodies 
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having certain advantages in immunohistochemical 
detection of the proteins of the present invention and 
monoclonal antibodies having advantages in identifying 
and distinguishing particular epitopes of the proteins 
of the present invention. 

Following immunization, the antibodies of the 
present invention can be produced using any art- 
accepted technique. Such techniques are well known in 
the art, Coligan et al . (eds.), Current Protocols in 
Immunology, John Wiley & Sons, Inc. (2001) (ISBN: 
0-471-52276-7); Zola, Monoclonal Antibodies : 
Preparation an d Use of Monoclonal Antibodies and 
Engineered An tibody Derivatives (Basics: From 
Background to Bench) . Springer Verlag (2000) (ISBN: 
0387915907); Howard et al . (eds.), Basic Methods in 
Antibody Produc tion and Characterization . CRC Press 
(2000) (ISBN: 0849394457); Harlow et al . (eds.), 
Antibodies: A Laboratory Manual , Cold Spring Harbor 
Laboratory (1998) (ISBN: 0879693142); Davis (ed. ) , 
Monoclonal Anti body Protocols , Vol. 45, Humana Press 
(1995) (ISBN: 0896033082); Delves (ed.), Antibody 
Production: Ess ential Techniques , John Wiley & Son Ltd 
(1997) (ISBN: 0471970107); Kenney, Antibody Solution: 
An Antibody Met hods Manual . Chapman & Hall (1997) 
(ISBN: 0412141914), incorporated herein by reference in 
their entireties, and thus need not be detailed here. 

Briefly, however, such techniques include, 
inter alia, production of monoclonal antibodies by 
hybridomas and expression of antibodies or fragments or 
derivatives thereof from host cells engineered to 
express immunoglobulin genes or fragments thereof. 
These two methods of production are not mutually 
exclusive: genes encoding antibodies specific for the 
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proteins or protein fragments of the present invention 
can be cloned from hybridomas and thereafter expressed 
in other host cells. Nor need the two necessarily be 
performed together: e.g., genes encoding antibodies 
5 specific for the proteins and protein fragments of the 
present invention can be cloned directly from B cells 
known to be specific for the desired protein, as 
further described in U.S. Pat. No. 5,627,052, the 
disclosure of which is incorporated herein by reference 

10 in its entirety, or from antibody-displaying phage. 

Recombinant expression in host cells is 
particularly useful when fragments or derivatives of 
the antibodies of the present invention are desired. 
Host cells for recombinant antibody 

15 production — either whole antibodies, antibody 

fragments, or antibody derivatives — can be prokaryotic 
or eukaryotic . 

Prokaryotic hosts are particularly useful for 
producing phage displayed antibodies of the present 

20 invention. 

The technology of phage-displayed antibodies, 
in which antibody variable region fragments are fused, 
for example, to the gene III protein (pill) or gene 
VIII protein (pVIII) for display on the surface of 

25 filamentous phage, such as M13, is by now well- 
established, Sidhu, Curr. Opin. Biotechnol . ll(6):610-6 
(2000); Griffiths et al . , Curr. Opin. Biotechnol. 
9(l):102-8 (1998); Hoogenboom et al., Immunotechnology, 
4(1): 1-20 (1998); Rader et a J . , Current Opinion in 

30 Biotechnology 8:503-508 (1997); Aujame et al., Human 
Antibodies 8:155-168 (1997); Hoogenboom, Trends in 
Biotechnol. 15:62-70 (1997); de Kruif et al . , 17:453- 
455 (1996); Barbas et al . , Trends in Biotechnol. 
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14:230-234 (1996); Winter et al . , Ann. Rev. Immunol. 
433-455 (1994), and techniques and protocols required 
to generate, propagate, screen (pan) , and use the 
antibody fragments from such libraries have recently 
5 been compiled, Barbas et al., Phage Display: A 

uhnr^.orv Manual , Cold Spring Harbor Laboratory Press 

(2001) (ISBN 0-87969-546-3); Kay et al . (eds.), Phage 
nisnlav of PepMHg.s and Proteins: A Laboratory Manual , 
Academic Press, Inc. (1996); Abelson et al . (eds.), 
10 rnmhinator^l Chemistry , Methods in Enzymology vol. 

267, Academic Press (May 1996), the disclosures of 

which are incorporated herein by reference in their 

entireties . 

Typically, phage-displayed antibody fragments 
15 are scFv fragments or Fab fragments; when desired, full 
length antibodies can be produced by cloning the 
variable regions from the displaying phage into a 
complete antibody and expressing the full length 
antibody in a further prokaryotic or a eukaryotic host 
20 cell. 

Eukaryotic cells are also useful for 
expression of the antibodies, antibody fragments, and 
antibody derivatives of the present invention. 

For example, antibody fragments of the 

25 present invention can be produced in Pichia pastoris, 
Takahashi et al . , Biosci. Biotechnol. Biochem. 
64 (10) : 2138-44 (2000); Freyre et al., J. Biotechnol. 
76 (2-3) : 157-63 (2000); Fischer et al., Biotechnol. 
Appl. Biochem. 30 (Pt 2):117-20 (1999); Pennell et al . , 

30 Res. Immunol. 149 ( 6) : 599-603 (1998); Eldin et al., J. 
Immunol. Methods. 201(1) :67-75 (1997); and in 
Saccharomyces cerevisiae, Frenken et al., Res. Immunol. 
149(6) : 589-99 (1998); Shusta et al., Nature Biotechnol. 



- 126 - 



16(8):773-7 (1998), the disclosures of which are 
incorporated herein by reference in their entireties. 

Antibodies, including antibody fragments and 
derivatives, of the present invention can also be 
produced in insect cells, Li et al . , Protein Expr. 
Purif. 21(1): 121-8 (2001); Ailor et al . , Biotechnol. 
Bioeng. 58 (2-3) : 196-203 (1998); Hsu et al., Biotechnol 
Prog. 13(1):96-104 (1997); Edelman et al . , Immunology 
91(l):13-9 (1997); and Nesbit et al . , J. Jjnmunol . 
Methods. 151 (1-2) :201-8 (1992), the disclosures of 
which are incorporated herein by reference in their 
entireties . 

Antibodies and fragments and derivatives 
thereof of the present invention can also be produced 
in plant cells, Giddings et al., Nature Biotechnol. 
18 (11) : 1151-5 (2000); Gavilondo et al., Biotechniques 
29(1): 128-38 (2000); Fischer et al . , J. Biol. Regul . 
Homeost. Agents 14 (2 ): 83-92 (2000); Fischer et al . , 
Biotechnol. Appl . Biochem. 30 (Pt 2):113-6 (1999); 
Fischer et al., Biol. Chem. 380 (7-8) : 825-39 (1999); 
Russell, Curr. Top. Microbiol. Immunol. 240:119-38 
(1999); and Ma et al., Plant Physiol. 109(2) :341-6 
(1995), the disclosures of which are incorporated 
herein by reference in their entireties. 

Mammalian cells useful for recombinant 
expression of antibodies, antibody fragments, and 
antibody derivatives of the present invention include 
CHO cells, COS cells, 293 cells, and myeloma cells. 

Verma et al., J. Immunol. Methods 
216 (1-2) : 165-81 (1998), review and compare bacterial, 
yeast, insect and mammalian expression systems for 
expression of antibodies. 
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Antibodies of the present invention can also 
be prepared by cell free translation, as further 
described in Merk et al., J. Biochem. (Tokyo). 
125 (2) : 328-33 (1999) and Ryabova et al., Nature 
5 Biotechnol. 15(1): 79-84 (1997), and in the milk of 

transgenic animals, as further described in Pollock et 
al., J. Immunol. Methods 231 (1-2) : 147-57 (1999), the 
disclosures of which are incorporated herein by 
reference in their entireties. 
10 The invention further provides antibody 

fragments that bind specifically to one or more of the 
proteins and protein fragments of the present 
invention, to one or more of the proteins and protein 
fragments encoded by the isolated nucleic acids of the 
15 present invention, or the binding of which can be 

competitively inhibited by one or more of the proteins 
and protein fragments of the present invention or one 
or more of the proteins and protein fragments encoded 
by the isolated nucleic acids of the present invention. 

Among such useful fragments are Fab, Fab', 
Fv, F(ab)' 2 , and single chain Fv (scFv) fragments. 
Other useful fragments are described in Hudson, Curr. 
Opin. Biotechnol. 9(4):395-402 (1998). 

It is also an aspect of the present invention 
25 to provide antibody derivatives that bind specifically 
to one or more of the proteins and protein fragments of 
the present invention, to one or more of the proteins 
and protein fragments encoded by the isolated nucleic 
acids of the present invention, or the binding of which 
can be competitively inhibited by one or more of the 
proteins and protein fragments of the present invention 
or one or more of the proteins and protein fragments 
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encoded by the isolated nucleic acids of the present 
invention. 

Among such useful derivatives are chimeric, 
primatized, and humanized antibodies; such derivatives 
are less immunogenic in human beings, and thus more 
suitable for in vivo administration, than are 
unmodified antibodies from non-human mammalian species. 

Chimeric antibodies typically include heavy 
and/or light chain variable regions (including both CDR 
and framework residues) of immunoglobulins of one 
species, typically mouse, fused to constant regions of 
another species, typically human. See, e.g., U.S. Pat. 
No. 5,807,715; Morrison et al . , Proc. Natl. Acad. Sci 
USA .81(21) :6851-5 (1984); Sharon et al., Nature 
309(5966) :364-7 (1984); Takeda et ai . , Nature 
314 (6010) : 452-4 (1985), the disclosures of which are 
incorporated herein by reference in their entireties. 
Primatized and humanized antibodies typically include 
heavy and/or light chain CDRs from a murine antibody 
grafted into a non-human primate or human antibody V 
region framework, usually further comprising a human 
constant region, Riechmann et al., Nature 
332 (6162) :323-7 (1988); Co et al., Nature 
351 (6326) :501-2 (1991); U.S. Pat. Nos. 6,054,297; 
5,821,337; 5,770,196; 5,766,886; 5,821,123; 5,869,619; 
6,180,377; 6,013,256; 5,693,761; and 6,180,370, the 
disclosures of which are incorporated herein by 
reference in their entireties. 

Other useful antibody derivatives of the 
invention include heteromeric antibody complexes and 
antibody fusions, such as diabodies (bispecific 
antibodies), single-chain diabodies, and intrabodies. 



The antibodies of the present invention, 
including fragments and derivatives thereof, can 
usefully be labeled. It is, therefore, another aspect 
of the present invention to provide labeled antibodies 
that bind specifically to one or more of the proteins 
and protein fragments of the present invention, to one 
or more of the proteins and protein fragments encoded 
by the isolated nucleic acids of the present invention, 
or the binding of which can be competitively inhibited 
by one or more of the proteins and protein fragments of 
the present invention or one or more of the proteins 
and protein fragments encoded by the isolated nucleic 
acids of the present invention. 

The choice of label depends, in part, upon 

the desired use. 

For example, when the antibodies of the 
present invention are used for immunohistochemical 
staining of tissue samples, the label can usefully be 
an enzyme that catalyzes production and local 
deposition of a detectable product. 

Enzymes typically conjugated to antibodies to 
permit their immunohistochemical visualization are well 
known, and include alkaline phosphatase, 3~ 
galactosidase, glucose oxidase, horseradish peroxidase 

(HRP), and urease. Typical substrates for production 
and deposition of visually detectable products include 

o-Nitrophenyl-beta-D-galactopyranoside (ONPG) ; 

o-Phenylenediaraine Dihydrochloride (OPD) ; p-Nitrophenyl 

Phosphate (PNPP) ; p-Nitrophenyl-beta-D- 

galactopryanoside (PNPG) ; 3 ' , 3 ' Diaminobenzidine (DAB) ; 

3-Amino-9-ethylcarbazole (AEC) ; 4-Chloro-l-naphthol 

(CN) ; 5-Bromo-4-chloro-3-indolyl-phosphate (BCIP) ; 

ABTS®; BluoGal; iodonitrotetrazolium (INT) ; nitroblue 

tetrazolium chloride (NBT) ; phenazine methosulfate 
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(PMS); phenolphthalein monophosphate (PMP) ; tetramethyl 
benzidine (TMB) ; tetranitroblue tetrazolium (TNBT) ; 
X-Gal; X-Gluc; and X-Glucoside. 

Other substrates can be used to produce 
products for local deposition that are luminescent. 
For example, in the presence of hydrogen peroxide 
(H 2 0 2 ), horseradish peroxidase (HRP) can catalyze the 
oxidation of cyclic diacylhydrazides, such as luminol. 
Immediately following the oxidation, the luminol is in 
an excited state (intermediate reaction product), which 
decays to the ground state by emitting light. Strong 
enhancement of the light emission is produced by 
enhancers, such as phenolic compounds- Advantages 
include high sensitivity, high resolution, and rapid 
detection without radioactivity and requiring only 
small amounts of antibody. See, e.g., Thorpe et al., 
Methods Enzymol. 133:331-53 (1986); Kricka et al . , J". 
Immunoassay 17(l):67-83 (1996); and Lundqvist et al . , 
J. Biolumin. Chemilumin. 10(6):353-9 (1995), the 
disclosures of which are incorporated herein by 
reference in their entireties. Kits for such enhanced 
chemiluminescent detection (ECL) are available 
commercially. 

The antibodies can also be labeled using 

colloidal gold. 

As another example, when the antibodies of 
the present invention are used, e.g., for flow 
cytometric detection, for scanning laser cytometric 
detection, or for fluorescent immunoassay, they can 
usefully be labeled with f luorophores . 

There are a wide variety of fluorophore 
labels that can usefully be attached to the antibodies 
of the present invention. 
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For flow cytometric applications, both for 
extracellular detection and for intracellular 
detection, common useful fluorophores can be 
fluorescein isothiocyanate (FITC) , allophycocyanin 
5 (APC) , R-phycoerythrin (PE) , peridinin chlorophyll 
protein (PerCP) , Texas Red, Cy3, Cy5, fluorescence 
resonance energy tandem fluorophores such as PerCP- 
Cy5.5, PE-Cy5, PE-Cy5.5, PE-Cy7, PE-Texas Red, and APC- 
Cy7. 

10 Other fluorophores include, inter alia, Alexa 

Fluor® 350, Alexa Fluor® 488, Alexa Fluor® 532, Alexa 
Fluor® 54 6, Alexa Fluor® 568, Alexa Fluor® 594, Alexa 
Fluor® 647 (monoclonal antibody labeling kits available 
from Molecular Probes, Inc., Eugene, OR, USA), BODIPY 

15 dyes, such as BODIPY 493/503, BODIPY FL, BODIPY R6G, 
BODIPY 530/550, BODIPY TMR, BODIPY 558/568, BODIPY 
558/568, BODIPY 564/570, BODIPY 576/589, BODIPY 
581/591, BODIPY TR, BODIPY 630/650, BODIPY 650/665, 
Cascade Blue, Cascade Yello, Dansyl, lissamine 

20 rhodamine B, Marina Blue, Oregon Green 488, Oregon 

Green 514, Pacific Blue, rhodamine 6G, rhodamine green, 
rhodamine red, tetramethylrhodamine, Texas Red 
(available from Molecular Probes, Inc., Eugene, OR, 
USA), and Cy2, Cy3, Cy3.5, Cy5, Cy5.5, Cy7, all of 

25 which are also useful for f luorescently labeling the 
antibodies of the present invention. 

For secondary detection using labeled avidin, 
streptavidin, captavidin or neutravidin, the antibodies 
of the present invention can usefully be labeled with 

30 biotin. 

When the antibodies of the present invention 
are used, e.g., for western blotting applications, they 
can usefully be labeled with radioisotopes, such as 33 P, 
32 P, 35 S, 3 H, and 125 I. 
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As another example, when the antibodies of 
the present invention are used for radioimmunotherapy, 
the label can usefully be 228 Th, 227 Ac, 225 Ac, 223 Ra, 213 Bi, 
2"Pb, 212 Bi, 211 At, 203 Pb, 194 0s, 188 Re, 186 Re, 153 Sm, 149 Tb, 131 I, 
125 I, lu In, 105 Rh, "»Tc, 97 Ru, 90 Y, 90 Sr, 88 Y, 72 Se, 67 Cu, or 
47 Sc. 

As another example, when the antibodies of 
the present invention are to be used for in vivo 
diagnostic use, they can be rendered detectable by 
conjugation to MR I contrast agents, such as gadolinium 
diethylenetriaminepentaacetic acid (DTPA) , Lauffer et 
al., Radiology 207(2) : 529-38 (1998), or by 
radioisotopic labeling 

As would be understood, use of the labels 
described above is not restricted to the application as 
for which they were mentioned. 

The antibodies of the present invention, 
including fragments and derivatives thereof, can also 
be conjugated to toxins, in order to target the toxin's 
ablative action to cells that display and/or express 
the proteins of the present invention. Commonly, the 
antibody in such immunotoxins is conjugated to 
Pseudomonas exotoxin A, diphtheria toxin, shiga toxin 
A, anthrax toxin lethal factor, or ricin. See Hall 
(ed.), Tmmunotoxin Methods and P rotocols (Methods in 
Molecular Biology, Vol 166), Humana Press (2000) 
(ISBN: 0896037754) ; and Frankel et ai . (eds.), Clinical 
Applications of Immunotoxins , Springer-Verlag New York, 
Incorporated (1998) (ISBN: 3540640975) , the disclosures 
of which are incorporated herein by reference in their 
entireties, for review. 

The antibodies of the present invention can 
usefully be attached to a substrate, and it is, 
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therefore, another aspect of the invention to provide 
antibodies that bind specifically to one or more of the 
proteins and protein fragments of the present 
invention, to one or more of the proteins and protein 
fragments encoded by the isolated nucleic acids of the 
present invention, or the binding of which can be 
competitively inhibited by one or more of the proteins 
and protein fragments of the present invention or one 
or more of the proteins and protein fragments encoded 
by the isolated nucleic acids of the present invention, 

attached to a substrate. 

Substrates can be porous or nonporous, planar 

or nonplanar . 

For example, the antibodies of the present 
invention can usefully be conjugated to filtration 
media, such as NHS-activated Sepharose or CNBr- 
activated Sepharose for purposes of immunoaf f inity 

chromatography . 

For example, the antibodies of the present 
invention can usefully be attached to paramagnetic 
microspheres, typically by biotin-streptavidin 
interaction, which microsphere can then be used for 
isolation of cells that express or display the proteins 
of the present invention. As another example, the 
antibodies of the present invention can usefully be 
attached to the surface of a microtiter plate for 
ELISA. 

As noted above, the antibodies of the present 
invention can be produced in prokaryotic and eukaryotic 
cells. It is, therefore, another aspect of the present 
invention to provide cells that express the antibodies 
of the present invention, including hybridoma cells, B 
cells, plasma cells, and host cells recombinants 
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modified to express the antibodies of the present 
invention. 

In yet a further aspect, the present 
invention provides aptamers evolved to bind 
specifically to one or more of the proteins and protein 
fragments of the present invention, to one or more of 
the proteins and protein fragments encoded by the 
isolated nucleic acids of the present invention, or the 
binding of which can be competitively inhibited by one 
or more of the proteins and protein fragments of the 
present invention or one or more of the proteins and 
protein fragments encoded by the isolated nucleic acids 
of the present invention. 

PAPP-E Antibodies 

In a first series of antibody embodiments, 
the invention provides antibodies, both polyclonal and 
monoclonal, and fragments and derivatives thereof, that 
bind specifically to polypeptides comprising an amino 
acid sequence as provided in SEQ ID NO: 7 - the N- 
terminal portion of PAPP-Ea, PAPP-Eb, and PAPP-Ec that 
is absent from the PAPP-Ef isoform - and the binding of 
which can be competitively inhibited by a polypeptide 
the sequence of which is given in SEQ ID NO: 7. Such 
antibodies can be used to discriminate the novel 
isoforms described herein from the PAPP-Ef isoform, but 
will not be able to discriminate as among PAPP-Ea, -Eb, 
and -Ec isoforms. 

Such antibodies are useful in in vitro 
immunoassays, such as ELISA of maternal serum, western 
blot of maternal serum, or immunohistochemical assay of 
chorionic villus samples, in which the collective 
concentration and/or quantity of PAPP-Ea, -Eb, and -Ec 
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protein isoforms provides diagnostic and/or prognostic 
information on pregnancy status. Such antibodies are 
also useful in isolating and purifying PAPP-E isoforms 
other than PAPP-Ef by immunoprecipitation, 
immunoaffinity chromatography, or magnetic bead- 
mediated purification. 

In another series of embodiments, the 
invention provides antibodies, both polyclonal and 
monoclonal, and fragments and derivatives thereof, that 
bind specifically to polypeptides comprising an ammo 
acid sequence as provided in SEQ ID NO: 12 - the region 
encoded by exon 21, unique to the PAPP-Eb isoform - and 
the binding of which can be competitively inhibited by 
a polypeptide the sequence of which is given in SEQ ID 
NO: 12. In a further series of embodiments, the 
invention provides antibodies, both polyclonal and 
monoclonal, and fragments and derivatives thereof, that 
bind specifically to polypeptides comprising an ammo 
acid sequence as provided in SEQ ID NO: 13 - the region 
C-terminal to exon 21 that is unique to PAPP-Eb isoform 
due to frameshift relative to the reading frame in 
PAPP-Ea, PAPP-Ec, and PAPP-Ef - and the binding of 
which can be competitively inhibited by a polypeptide 
the sequence of which is given in SEQ ID NO: 13. In yet 
another series of embodiments, the invention provides 
antibodies, both polyclonal and monoclonal, and 
fragments and derivatives thereof, that bind 
specifically to polypeptides comprising an amino acid 
sequence as provided in SEQ ID NO: 14 - the region that 
, is uniquely found in PAPP-Eb - and the binding of whrch 
can be competitively inhibited by a polypeptide the 
sequence of which is given in SEQ ID NO: 14. 

All of the antibodies of these latter three 
series of embodiments can be used to discriminate the 
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PAPP-Eb isoform from all other isoforms. Such 
antibodies are useful in in vitro immunoassays, such as 
ELISA of maternal serum, western blot of maternal 
serum, or immunohistochemical assay of chorionic villus 
samples, in which the collective concentration and/or 
quantity of the PAPP-Eb isoform provides diagnostic 
and/or prognostic information on pregnancy status. 
Such antibodies are also useful in isolating and 
purifying the PAPP-Eb isoform by immunoprecipitation, 
immunoaffinity chromatography, or magnetic bead- 
mediated purification. 

In yet a further series of embodiments, the 
invention provides antibodies, both polyclonal and 
monoclonal, and fragments and derivatives thereof, that 
bind specifically to polypeptides comprising an amino 
acid sequence as provided in SEQ ID NO: 18 - a 20 amino 
acid region of PAPP-Ec centered about the deletion of 
exons 2 and 3 - and the binding of which can be 
competitively inhibited by a polypeptide the amino acid 
sequence of which is given in SEQ ID NO: 18 and cannot 
be competitively inhibited by a polypeptide having the 
amino acid sequence of SEQ ID NO:l (the full-length 
PAPP-Ea protein) . 

Such antibodies can be used to discriminate 
the PAPP-Ec isoform from the other known isoforms, and 
are useful in in vitro immunoassays, such as ELISA of 
maternal serum, western blot of maternal serum, or 
immunohistochemical assay of chorionic villus samples, 
in which the concentration and/or quantity of PAPP-Ec 
isoform provides diagnostic and/or prognostic 
information on pregnancy status. Such antibodies are 
also useful in isolating and purifying the PAPP-Ec 
isoform by immunoprecipitation, immunoaffinity 
chromatography, or magnetic bead-mediated purification. 
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In other embodiments, the invention further 
provides the above-described antibodies detectably 
labeled, and in yet other embodiments, provides the 
above-described antibodies attached to a substrate. 

5 PHARMACEUTICAL COMPOSITIONS 

PAPP-E isoforms are important for maintenance 
of pregnancy and maturation of ovarian follicles. 
Thus, compositions comprising nucleic acids and 
proteins of the present invention can be administered 
10 as contraceptive vaccines and antibodies of the present 
invention can be administered for passive immunization, 
and thus reversible contraception. Alternatively, 
proteins of the present invention can be administered 
by nonimmunogenic routes as replacement therapy in 
15 patients with decreased levels of PAPP-E isoforms, thus 
supporting at-risk pregnancies. 

Accordingly, in another aspect, the invention 
provides pharmaceutical compositions comprising the 
nucleic acids, nucleic acid fragments, proteins, 
20 protein fusions, protein fragments, antibodies, 

antibody derivatives, and antibody fragments of the 

present invention. 

Such a composition typically contains from 
about 0.1 to 90% by weight (such as 1 to 20% or 1 to 

25 10%) of a therapeutic agent of the invention in a 

pharmaceutical ly accepted carrier. Solid formulations 
of the compositions for oral administration can contain 
suitable carriers or excipients, such as corn starch, 
gelatin, lactose, acacia, sucrose, microcrystalline 

30 cellulose, kaolin, mannitol, dicalcium phosphate, 

calcium carbonate, sodium chloride, or alginic acid. 
Disintegrators that can be used include, without 
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10 



limitation, microcrystalline cellulose, corn starch, 
sodium starch glycolate, and alginic acid. Tablet 
binders that can be used include acacia, 
methylcellulose, sodium carboxymethylcellulose, 
polyvinylpyrrolidone (Povidone™) , hydroxypropyl 
methylcellulose, sucrose, starch and ethylcellulose . 
Lubricants that can be used include magnesium 
stearates, stearic acid, silicone fluid, talc, waxes, 
oils, and colloidal silica. 

Liquid formulations of the compositions for 
oral administration prepared in water or other aqueous 
vehicl.es can contain various suspending agents such as 
methylcellulose, alginates, tragacanth, pectin, kelgin, 
carrageenan, acacia, polyvinylpyrrolidone, and 
15 polyvinyl alcohol. The liquid formulations can also 
include solutions, emulsions, syrups and elixirs 
containing, together with the active compound(s), 
wetting agents, sweetners, and coloring and flavoring 
agents. Various liquid and powder formulations can be 
20 prepared by conventional methods for inhalation into 
the lungs of the mammal to be treated. 

Injectable formulations of the compositions 
can contain various carriers such as vegetable oils, 
dimethylacetamide, dimethyl formamide, ethyl lactate, 
25 ethyl carbonate, isopropyl myristate, ethanol, polyols 
(glycerol, propylene glycol, liquid polyethylene 
glycol, and the like) . For intravenous injections, 
water soluble versions of the compounds can be 
administered by the drip method, whereby a 
30 pharmaceutical formulation containing the antifungal 
agent and a physiologically acceptable excipient is 
infused. Physiologically acceptable excipients can 
include, for example, 5% dextrose, 0.9% saline, 
Ringer's solution or other suitable excipients. 
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intramuscular preparations, e.g., a sterile formulation 
of a suitable soluble salt form of the compounds, can 
be dissolved and administered in a pharmaceutical 
excipient such as Water-f or-Injection, 0.9% saline, or 
5% glucose solution. A suitable insoluble form of the 
compound can be prepared and administered as a 
suspension in an aqueous base or a pharmaceutical^ 
acceptable oil base, such as an ester of a long chain 
fatty acid (e.g., ethyl oleate) . 

A topical semi-solid ointment formulation 
typically contains a concentration of the active 
ingredient from about 1 to 20%, e.g., 5 to 10%, in a 
carrier such as a pharmaceutical cream base. Various 
formulations for topical use include drops, tinctures, 
lotions, creams, solutions, and ointments containing 
the active ingredient and various supports and 
vehicles. The optimal percentage of the therapeutic 
agent in each pharmaceutical formulation varies 
according to the formulation itself and the therapeutic 
effect desired in the specific pathologies and 
correlated therapeutic regimens . 

Inhalation and transdermal formulations can 

also readily be prepared. 

Pharmaceutical formulation is a well- 
established art, and is further described in Gennaro 
(ed.), Remington: t h q S riP,r. e and Practice of Pharmacy , 
20 th ed., Lippincott, Williams & Wilkins (2000) (ISBN: 
0683306472); and Ansel et a J . , Pharmaceutical Dosage 
For-™* and Dr-im Delive ry Systems, 7 th ed. , Lippincott 
Williams & Wilkins Publishers (1999) (ISBN: 
0683305727), the disclosures of which are incorporated 
herein by reference in their entireties. 

Conventional methods, known to those of 
ordinary skill in the art of medicine, can be used to 
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administer the pharmaceutical f ormulation (s) to the 
patient . 

Typically, the pharmaceutical formulation 
will be administered to the patient by applying to the 
5 skin of the patient a transdermal patch containing the 
pharmaceutical formulation, and leaving the patch in 
contact with the patient's skin (generally for 1 to 5 
hours per patch) . Other transdermal routes of 
administration (e.g., through use of a topically 
10 applied cream, ointment, or the like) can be used by 
applying conventional techniques. The pharmaceutical 
formulation (s) can also be administered via other 
conventional routes (e.g., enteral, subcutaneous, 
intrapulmonary, transmucosal, intraperitoneal, 
15 intrauterine, sublingual, intrathecal, or intramuscular 
routes) by using standard methods. In addition, the 
pharmaceutical formulations can be administered to the 
patient via injectable depot routes of administration 
such as by using 1-, 3-, or 6-month depot injectable or 
20 -biodegradable materials and methods. 

Regardless of the route of administration, 
the therapeutic protein or antibody agent typically is 
administered at a daily dosage of 0.01 mg to 30 mg/kg 
of body weight of the patient (e.g., Img/kg to 5 
25 mg/kg) . The pharmaceutical formulation can be 

administered in multiple doses per day, if desired, to 
achieve the total desired daily dose. 

The effectiveness of the method of treatment 
can be assessed by monitoring the patient for known 
30 signs or symptoms of a disorder. 



TRANSGENIC ANIMALS AND CELLS 
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In another aspect, the invention provides 
transgenic cells and non-human organisms comprising 
human PAPP-E isoform nucleic acids, and transgenic 
cells and non-human organisms with targeted disruption 
5 of the endogenous orthologue of the human PAPP-E gene. 

The cells can be embryonic stem cells or 
somatic cells. The transgenic non-human organisms can 
be chimeric, nonchimeric heterozygotes, and nonchimeric 
homo zygotes. 

10 DIAGNOSTIC METHODS 

The nucleic acids of the present invention 
can be used as nucleic acid probes to assess the levels 
of PAPP-E isoform mRNA in chorionic villus samples, and 
antibodies of the present invention can be used to 
15 assess the expression levels of PAPP-E isoform proteins 
in chorionic villus samples, to diagnose dysgenetic 
pregnancies antenatally. 

EXAMPLE 1 

Identification and Characterization of 
20 cDNAs Encoding Multiple Isoforms of Human PAPP-E 

Predicating our gene discovery efforts on use 
of genome-derived single exon probes and hybridization 
to genome-derived single exon microarrays - an approach 
that we have previously demonstrated will readily 
25 identify novel genes that have proven refractory to 
mRNA-based identification efforts - we identified an 
exon in raw human genomic sequence that is particularly 
expressed in human placenta. 
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Briefly, bioinformatic algorithms were 
applied to human genomic seguence data to identify 
putative exons. Each of the predicted exons was 
amplified from genomic DNA, typically centering the 
putative coding sequence within a larger amplicon that 
included flanking noncoding sequence. These genome- 
derived single exon probes were arrayed on a support 
and expression of the bioinf ormatically predicted exons 
assessed through a series of simultaneous two-color 
hybridizations to the genome-derived single exon 

microarrays . 

The approach and procedures are further 
described in detail in Penn et al., "Mining the Human 
Genome using Microarrays of Open Reading Frames," 
Nature Genetics 26:315-318 (2000); commonly owned and 
copending U.S. patent application nos . 09/774,203, 
filed January 29, 2001 ("Methods and Apparatus for 
Predicting, Confirming, and Displaying Functional 
Information Derived from Genomic Sequence") and 
09/632,366, filed August 3, 2000 ("Methods and 
Apparatus for High-throughput Detection and 
Characterization of Alternatively Spliced Genes"), and 
commonly owned and copending U.S. provisional patent 
application nos. 60/207,456, filed May 26, 2000 ("Human 
Genome-derived Single Exon Nucleic Acid Probes Useful 
for Gene Expression Analysis by Microarray" ) , the 
disclosures of which are incorporated herein by 
reference in their entireties. 

Using a graphical display particularly 
designed to facilitate computerized query of the 
resulting exon-specif ic expression data, as further 
described in commonly owned and copending U.S. patent 
application no. 09/774,203, filed January 29, 2001 
("Methods and Apparatus for Predicting, Confirming, and 
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Displaying Functional Information Derived from Genomic 
Sequence")/ two exons were identified that are 
expressed at high levels in human placenta, but that 
are expressed, if at all, at low levels in human heart, 
brain, adult liver, HeLa cells, lung, fetal liver, 
HBL100 cells, bone marrow, and BT4 7 4 cells; subsequent 
analysis revealed that the two exons belong to the same 
gene. Further details of procedures and results are 
set forth in commonly owned and copending U.S. 
provisional patent application no. 60/207,456, filed 
May 26, 2000 ("Human Genome-derived Single Exon Nucleic 
Acid Probes Useful for Gene Expression Analysis by 

Microarray") . 

Table 1 summarizes the microarray expression 
data obtained using genome-derived single exon probes 
corresponding to exons 1 and 2. Each probe was 
completely sequenced on both strands prior to its use 
on a genome-derived single exon microarray; sequencing 
confirmed the exact chemical structure of each probe. 
An added benefit of sequencing is that it placed us in 
possession of a set of single base-incremented 
fragments of the sequenced nucleic acid, starting from 
the sequencing primer's 3' OH. (Since the single exon 
probes were first obtained by PCR amplification from 
genomic DNA, we were of course additionally in 
possession of an even larger set of single base 
incremented fragments of each of the single exon 
probes, each fragment corresponding to an extension 
product from one of the two amplification primers.) 

Signals and expression ratios are normalized 
values measured and calculated as further described in 
commonly owned and copending U.S. patent application 
no. 09/774,203, filed January 29, 2001 ("Methods and 
Apparatus for Predicting, Confirming, and Displaying 



- 144 - 

Functional Information Derived from Genomic 
Sequence" ), and U.S. provisional patent application nos. 
60/207,456, filed May 26, 2000 ("Human Genome-derived 
Probes Useful for Gene Expression Analysis by 
5 Microarray") . 
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20 As shown in Table 1, significant expression 

of exons 1 and 2 was seen only in placenta. Placenta- 
specific expression was further confirmed by northern 
blot analysis (see below) . 

Marathon-Ready™ placenta cDNA (Clontech 

25 Laboratories, Palo Alto, CA, USA, catalogue no. 7411-1) 
was used as a substrate for standard RACE (rapid 
amplification of cDNA ends) to obtain a cDNA clone that 



spans 6.3 kilobases and appears to contain the entire 
coding region of the gene to which the two exons 
contribute; for reasons described below, we termed this 
cDNA PAPP-Ea. Marathon-Ready™ cDNAs are adaptor- 
ligated double stranded cDNAs suitable for 3' and 5' 
RACE. Chenchik et al. t BioTechniques 21:52 6-532 
(1996); Chenchik et al . , CLONTECHniques X (1) : 5-8 
(January 1995) . RACE techniques are described, inter 
alia, in the Marathon-Ready"* cDNA User Manual (Clontech 
Labs., Palo Alto, CA, USA, March 30, 2000, Part No. 
PT1156-1 (PR03517)), Ausubel et al. (eds.), Short 
Protocols in Molpnilar Biology : A Compendium of 
Methods from Current Protoc ols in Molecular Biology, 4 th 
edition (April 1999), John Wiley & Sons (ISBN: 
047132938X) and Sambrook et al. (eds.), Molecular 
Cloning; A Laboratory Manual (3rd ed.), Cold Spring 
Harbor Laboratory Press (2000) (ISBN: 0879695773), the 
disclosures of which are incorporated herein by 
reference in their entireties. 

The PAPP-Ea cDNA was sequenced on both 
strands using a MegaBace™ sequencer (Molecular 
Dynamics, Inc., Sunnyvale, CA, USA). Sequencing both 
strands provided us with the exact chemical structure 
of the cDNA, which is shown in FIG. 3 and further 
presented in the SEQUENCE LISTING as SEQ ID NO:l, and 
placed us in actual physical possession of the entire 
set of single-base incremented fragments of the 
sequenced clone, starting at the 5' and 3 ' termini. 

A 398 bp fragment of PAPP-Ea cDNA (nt 2314 to 
2711, including part of exon 4, the whole of exon 5, 
and part of exon 6) was prepared by PCR using PAPP-Ea 
cDNA as template; the fragment was thereafter labeled 
by random priming and used to probe a northern blot of 
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12 tissues (leukocyte, lung, placenta, small intestine, 
liver, kidney, spleen, thymus, colon, skeletal muscle, 
heart and brain) . Blot (not shown) confirmed the 
placenta-specific expression pattern. In addition, the 
random priming placed us in possession of a near 
complete set of fragments of the 398 bp PAPP-Ea cDNA 
fragment . 

Further cDNA clones were obtained from the 
same cDNA library using primers designed to capture the 
entire PAPP-Ea ORF; these efforts identified two splice 
variants that we designated PAPP-Eb and PAPP-Ec, 
respectively. 

The PAPP-Eb and PAPP-Ec cDNAs were sequenced 
on both strands using a MegaBace™ sequencer (Molecular 
Dynamics, Inc., Sunnyvale, CA, USA). Sequencing both 
strands provided us with the exact chemical structure 
of the PAPP-Eb cDNA, shown in FIG. 4 and further 
presented in the SEQUENCE LISTING as SEQ ID NO: 8, and 
of the PAPP-Ec cDNA, shown in FIG. 5 and further 
presented in the SEQUENCE LISTING as SEQ ID NO: 10. 
Sequencing further placed us in actual physical 
possession of the entire set of single-base incremented 
fragments of the sequenced clones, starting at the 5' 
and 3 ' termini . 

PAPP-Ea, PAPP-Eb, and PAPP-Ec cDNAs were 
deposited at the American Type Culture Collection on 

, 2001, under accession numbers , 

, and , respectively. 

As shown in FIG. 3, the PAPP-Ea cDNA spans 
6719 nucleotides and contains an open reading frame 
from nucleotide 7 67 through and including nt 6142 
(inclusive of termination codon) , predicting a protein 
of 1791 amino acids with a (posttranslationally 
unmodified) molecular weight of 198.6 kD. The clone 



- 147 - 



appears full length, with the reading frame opening 
with a methionine and terminating with a stop codon 
before a 3' poly-A tail. 

As further shown in FIGS. 4 and 5, 
respectively, splice variants PAPP-Eb and PAPP-Ec are 
5461 and 4158 nt, respectively. Because the two clones 
were obtained using a 5' primer designed to amplify 
only the PAPP-Ea coding region, the clones lack 5' 
untranslated region (5' UT) ; we presume that the 5' UT 
of these two clones, both of which start with the same 
exon as PAPP-Ea, should be identical to that for the 
PAPP-Ea clone. The PAPP-Eb and PAPP-Ec clones encode 
proteins of 1770 (PAPP-Eb) and 1385 (PAPP-Ec) amino 
acids, respectively, with predicted (post- 
translationally unmodified) molecular weights of 196 kD 
and 152 kD, respectively. 

BLAST query of genomic sequence identified 
four BACs, spanning 265 kb, that constitute the minimum 
set of clones encompassing the three cDNA sequences. 
Based upon the known origin of the four BACs (GenBank 
accession numbers AL031734, AC027620, AL139282, and 
AL031290), the PAPP-E gene can be mapped to human 
chromosome lq24.1- lq25.2. 

Comparison of the cDNA and genomic sequences 
identified 23 exons . Exon organization is listed in 
Table 2. 
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Table 2 
-E Exon Structure 
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FIG. 2 schematizes the exon organization of 
the PAPP-Ea, Eb, and Ec clones. 

Insertion of the 85 bp exon 21 uniquely in 
PAPP-Eb leads to a downstream frame shift, shown by 
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10 



15 



shading of exon 21, with earlier termination of 
translation. PAPP-Ec lacks exons 2, 3 and 21 . 

The sequence of the PAPP-Ea cDNA was used as 
a BLAST query into the GenBank nr and dbEst databases. 
The nr database includes all non-redundant GenBank 
coding sequence translations, sequences derived from 
the 3-dimensional structures in the Brookhaven Protein 
Data Bank (PDB) , sequences from SwissProt, sequences 
from the protein information resource (PIR) , and 
sequences from protein research foundation (PRF) . The 
dbEst (database of expressed sequence tags) includes 
ESTs, short, single pass read cDNA (mRNA) sequences, 
and cDNA sequences from differential display 
experiments and RACE experiments. 

BLAST search identified multiple human ESTs, 
mainly from placental sources, one EST from mouse 
(AI157031), one from rat (AW916144) , and one from cow 
(AW660476) as having sequence closely related to 
PAPP-Ea. BLAST search also identified as closely 
related a newly described human gene, termed PAPP-E, 
further described in Farr et al . , Biochim. Biophys. 
Acta 1493:356-362 (October 2000), and human pregnancy- 
associated protein-A (PAPP-A) . 

Because the PAPP-E clone described by Farr et 
25 al. appears to be either another isoform, or an 

incomplete clone, of the gene that we have identified, 
we have named our gene PAPP-E and termed the Farr et 
al. clone PAPP-Ef ("f" in deference to the authors). 

As shown in FIG. 2, the PAPP-Ef cDNA includes 
only a portion of exon 1, and is thereafter identical 
(with a single nucleotide change) to PAPP-Ea, including 
all of exons 2 - 20, 22 and 23. The exact start of the 
PAPP-Ef and PAPP-Ef translations are shown on each of 



20 



30 
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FIGS. 3, 4, and 5. We detect only a single nucleotide 
difference between PAPP-Ea and PAPP-Ef in the region in 
which they are coextensive. 

Globally, PAPP-Ea resembles human (44% amino 
acid identity and 63% amino acid similarity) and mouse 
(46% amino acid identity and 63% amino acid similarity) 
PAPP-A protein. 

Motif searches using Pfam 
(http://pfam.wustl.edu), SMART (http://smart.embl- 
heidelberg.de), and PROSITE pattern and profile 
databases (http://www.expasy.ch/prosite) , identified 
several known domains shared with PAPP-A. 

FIG. 1 shows the domain structure of PAPP-A 
and all known isoforms of the PAPP-E protein. 

As schematized in FIG. 1, our newly isolated 
isoforms - PAPP-Ea, PAPP-Eb, and PAPP-Ec - share 
certain protein domains and an overall structural 
organization with PAPP-A; in conjunction with a pattern 
of expression strikingly similar to that of PAPP-A, 
with high level expression in placenta, the shared 
structural features strongly imply that the three PAPP- 
E isoforms play a similar role in regulating the 
activity of a plasma borne growth factor(s), possibly 
IGF, which in turn is important for maintenance of 
pregnancy and/or normal fetal development, making the 
PAPP-E isoforms clinically useful diagnostic markers 
and potential therapeutic agents. 

Like PAPP-A, all three novel isoforms have 
the zinc-binding domain ("zinc") characteristic of 
metzincin superfamily metalloproteases , defined by the 
degenerate motif "HEXXHXXGXXH" , where invariant 
residues are shown underlined and variable residues are 
shown as "X". In PAPP-Ea, the longest isoform, the 
zinc binding domain occurs at residues 733 - 743 with 
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sequence HE VGHVLGL YH ; the sequence is underlined in 
FIG. 3. 

In common with PAPP-A, all three novel 
isoforms of PAPP-E have an at least four-fold 
repetition near the C-terminus of the short consensus 
repeat ("SCR"; alternatively denominated "sushi" 
domain) (residues 1396-1459, 1464-1521, 1525-1590, and 
1595-1646, numbered as in PAPP-Ea) . Relaxation of 
certain bioinf ormatic parameters suggests the presence 
of a fifth SCR domain. 

In common with PAPP-A, all three novel 
isoforms of PAPP-E also have at least one "NL" (notch- 
lin, also termed lin notch repeat, or "LNR") domain, 
so-called due to its presence in Notch and Lin-12 
proteins, both of which proteins regulate early tissue 
differentiation. As shown in FIG. 1, PAPP-Ea possesses 
three NL domains in the same general spaced 
relationship to the zinc domain as is found in PAPP-A. 
PAPP-Eb, in contrast, lacks the C-terminal NL domain, 
whereas PAPPE-c, the shortest of the novel isoforms, 
lacks the two NL domains on the N-terminal side of the 
zinc-binding domain. 

The four-fold repetition of SCR ("sushi") 
domains is characteristic of complement proteins and 
selectins. Five-fold repetition of SCR domains with 
further presence of at least one NL domain has been 
previously identified in complement decay-accelerating 
factor and P-selectin. 

In contrast to PAPP-A, two of the novel 
isoforms of PAPP-E - PAPP-Ea and PAPP-Eb - have a 
laminin G domain. Laminin G domains are found in a 
number of extracellular and receptor proteins, and are 
implicated in interactions with cellular receptors 
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(integrins, alpha-dystroglycan) , sulfated carbohydrates 
and other extracellular ligands. 

In contrast to PAPP-A, all three novel 
isoforms of PAPP-E contain nuclear localization signals 
5 ( "NLS" ) ; with concurrent presence of a leader sequence 
(not shown), these signals suggest that all three 
PAPP-E isoforms can be secreted and also localize to 
the cell nucleus. 

Possession of the genomic sequence permitted 
10 search for promoter and other control sequences for the 

hPAPP-E gene. 

A putative transcriptional control region, 
inclusive of promoter and downstream elements, was 
defined as 1 kb around the transcription start site, 
15 itself defined as the first nucleotide of the PAPP-Ea 
cDNA clone. The region, drawn from sequence of BAC 
AL031734.9, has the following sequence, where 
nucleotide number 1001 is the transcription start site: 



40 
80 
120 



tcttccccatcctttccatccatttcaaatcaattggaaa 
20 catggttccttgggtctagctgttcatttttgtaaattac 
ttattttgaacatctcattgtttatttgctcactcagcat 
atggtgacttttagtaacttcagattgagaaacttctgag 1 60 
ataaaaaggagacctatgtagtatgaattcatggcatttc 200 
catttagtacttctcacagcaggatacttgatttctcctt 240 

25 tctcccatgtccgatttaaagtgaatttaagatattgttc 280 

ttttaaatccccaatgattgaacaaagtaagaaaaaatac 320 
tttgttttgtttgtgacaaaacaaaagaaaaatacaaggg 
atccctaaaaggttagtgtgggcttattaggcagtaggta 
gatctgttcacagtaagtgtgtgtgtgtgtgtgtgtgtgt 

30 gtgtgagagagagagagagagagagagggagaatacacac 480 

agagaagagtactccaaaacactattgattttttgctatt 520 
gattgtgtaggctgcggctgctgaaagagaaagcccgaga 560 
tgtttactggggaaaccaagagtagcgtctgtcccctgtg 600 



360 
400 
440 
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ccttggtgaggtgggtaggttttcaggaggaaggagggga 640 

cagggaggagtaggtggagt gat gcattgaact tact age 68 0 

tttgacatcatcattgtctttaaatgaaaacaaaaacaaa 720 

aacaaaaacaaaaaacaagaagatatttacaggcagacag 7 60 

aaagggagccaaggggagcaggagagactggagagaacag 800 

gtcccctgaagtgtatgctcttctttttgctcttttcccg 840 

atcttcccaggaacccacaagactcccagaaggtgaagtt 8 80 

aagagctcccagactcataaggttattagaacagcaaact 920 

ggcaccccaaagaactttacggagacttgcaacctatcaa 960 

caagttggatgagggattaaaagccttcaacaaccaacaa 1000 
[SEQ ID NO: 65] . 

Using PROSCAN, 
(http : //bimas . dcrt . nih . gov/molbio/proscan/ ) , 
no significant promoter was identified in the putative 
promoter region. However, transcription factor binding 
sites were identified using a web site at 
http://motif.genome.ad.jp/, including a group of SRY 
(sex-determining region Y gene product) binding sites 
(726.. 732; 714.. 720; 339.. 345 bp, with numbering 
according to SEQ ID NO: 65). 

We have thus identified three novel 
isoforms of a newly described human gene, PAPP-E, which 
share certain protein domains and an overall structural 
organization with PAPP-A; in conjunction with a pattern 
of expression strikingly similar to that of PAPP-A, 
with high level expression in placenta, the shared 
structural features strongly imply that the three 
PAPP-E isoforms play a role similar to PAPP-A, 
regulating the activity of a plasma-borne growth 
factor (s), possibly IGF, which in turn is important for 
maintenance of pregnancy and/or normal fetal 
development, making the PAPP-E isoforms clinically 
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useful diagnostic markers and potential therapeutic 
agents . 



EXAMPLE 2 
Preparation and Labeling of 
5 Useful Fragments of Human PAPP-E Isoforms 



Useful fragments of PAPP-Ea, PAPP-Eb, and 
PAPP-Ec clones are produced by PCR, using standard 
techniques, or solid phase chemical synthesis using an 
automated nucleic acid synthesizer. Each fragment is 
10 sequenced, confirming the exact chemical structure 
thereof. 

The exact chemical structure of preferred 
fragments is provided in the attached SEQUENCE LISTING, 
the disclosure of which is incorporated herein by 
15 reference in its entirety. The following summary 
identifies the fragments whose structures are more 
fully described in the SEQUENCE LISTING: 





SEQ 


ID 


NO: 


1 


nt 


full length PAPP-Ea cDNA 


20 


SEQ 


ID 


NO: 


2 


nt 


coding region of PAPP-Ea 




SEQ 


ID 


NO: 


3 


aa 


full length coding seguence of 
PAPP-Ea 




SEQ 


ID 


NO: 


4 


nt 


5' sequence absent from PAPP- 
Ef clone 


25 


SEQ 


ID 


NO: 


5 


nt 


5* UT absent from PAPP-Ef 
clone (nt 1 - 766) 




SEQ 


ID 


NO: 


6 


nt 


N-terminal coding region 
absent from PAPP-Ef clone 




SEQ 


ID 


NO: 


7 


aa 


coding region of PAPPE-Ea from 



30 aa 1 - 19 [N terminal coding 

region absent from PAPP-Ef 
clone] 
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SEQ 
SEQ 
SEQ 

5 SEQ 
SEQ 
SEQ 

10 

SEQ 



SEQ 

15 SEQ 
SEQ 
SEQ 

20 

SEQ 
SEQ 
SEQ 
SEQ 

25 SEQ 
SEQ 
SEQ 
SEQ 
SEQ 

30 SEQ 
SEQ 
SEQ 
SEQ 
SEQ 



ID NO: 8 nt 

ID NO: 9 nt 

ID NO: 10 aa 

ID NO: 11 nt 

ID NO: 12 aa 

ID NO: 13 aa 

ID NO: 14 aa 

ID NO: 15 nt 

ID NO: 16 aa 

ID NO: 17 nt 

ID NO: 18 aa 

ID NO: 19 nt 

ID NO: 20 nt 

ID NO: 21 nt 

ID NO: 22 nt 

ID NO: 23 nt 

ID NO: 24 nt 

ID NO: 25 nt 

ID NO: 26 nt 

ID NO: 27 nt 

ID NO:28 nt 

ID NO:29 nt 

ID NO: 30 nt 

ID NO: 31 nt 

ID NO: 32 nt 



full length PAPP-Eb cDNA 
coding region of PAPP-Eb cDNA 
full length coding sequence of 
PAPP-Eb (aa 1 - 1770) 
exon novel in PAPP-Eb (nt 
5203-5287) 

CDS entirely within novel exon 
(aa 1735 - 1762) 
CDS due to frame shift (aa 
1763 - 1770) 

CDS novel within PappE-b (aa 
1735 - 1770) (inclusive of old 
nt) 

coding region of PAPP-Ec cDNA 
full length coding sequence of 
PAPP-Ec 

nt 892 - 951 (around splice 
junction) 

20 amino acids centered about 

deletion: aa 298 - 317 

exon 1 (from genomic sequence) 

exon 2 

exon 3 

exon 4 

exon 5 

exon 6 

exon 7 

exon 8 

exon 9 

exon 10 

exon 11 

exon 12 

exon 13 

exon 14 
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O-L-i V 


m 


MO' 33 


nt 


pxnn 15 








TD 

X J-/ 


NO • 14 

IN W • J *1 


nt 


pYnn 1 6 






SEQ 


ID 


NO: 35 


nt 


exon 17 






QTTO 
DJCjV 


t n 


In U « -3D 


n t 


t; AUii x o 




5 


SEQ 


ID 


NO: 37 


nt 


exon 19 






oiiiy 




JNU . JO 


11 U 


q v r~\ O O 








t n 


MO • 1 Q 
In U • Oi? 


11 U 


eJvUll <£. X 








t n 

X JJ 


JNU . *± U 


ill 


exon 






SEQ 


ID 


NO: 41 


nt 


exon 2 3 




1 u 




t n 
ID 


MO • /I O 
JNU . H Z 


UL 


OUU up yciiOIUlu 

centered about 


cLilip X 1UU11 

exon 1 






1 V 


MO • A "3 

JNU . 4 o 


nt 


ouu op ycuoiuiu 
centered about 


CLilip J_ X L.U11 

exon 2 




C TO 




MO • 4 4 
JNU • H fi 


T-i t" 

I1L 


>J\J\J Up y cl lOlLl _L ^ 


dlllkV X X uull 


15 










centered about 


exon 3 






X JJ 


MO • 4^ 
IN U • H D 


n l 


*J\J\J Up L) cUUiUlO 

r*p*n t n e^ri r^bont 

UCU LCI cu auuuu 


=j rr\TTi 1 "i pnn 

exon 4 






Tn 
X JJ 


MO • 4 

1N\^ • *4 U 


nt 


S00 nn cre^n om i c 

centered about 


amr) licon 

exon 5 


9 n 
z u 




t n 

X JJ 


MO • 4 7 


n t 


SOD hn rro-n otti i r* 

centered about 


amn 1 iron 
exon 6 






TD 
J. u 


MO • 4 ft 


nt 


ROD hn rrpnoTTi i o 

j y ciiviuj.v^ 

centered about 


amn 1 icon 

exon 7 






J- LJ 


MO « 4 Q 


nt 

11 L. 


500 hn apnomir 

>j \j \j i^J V-s y v3- 1 1 WILL _1_ 


amol icon 


25 










centered about 


exon 8 




SEQ 


ID 


NO: 50 


nt 


500 bp genomic 
centered about 


amp licon 
exon 9 




SEQ 


ID 


NO: 51 


nt 


500 bp genomic 
centered about 


amplicon 
exon 10 


30 


SEQ 


ID 


NO: 52 


nt 


500 bp genomic 
centered about 


amplicon 
exon 11 




SEQ 


ID 


NO: 53 


nt 


500 bp genomic 
centered about 


amplicon 
exon 12 
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SEQ ID NO: 54 nt 500 bp genomic amplicon 

centered about exon 13 
SEQ ID NO: 55 nt 500 bp genomic amplicon 

centered about exon 14 
SEQ ID NO: 56 nt 500 bp genomic amplicon 

centered about exon 15 
SEQ ID NO: 57 nt 500 bp genomic amplicon 

centered about exon 16 
SEQ ID NO: 58 nt 500 bp genomic amplicon 

centered about exon 17 
SEQ ID NO: 59 nt 500 bp genomic amplicon 

centered about exon 18 
SEQ ID NO: 60 nt 500 bp genomic amplicon 

centered about exon 19 
SEQ ID NO: 61 nt 500 bp genomic amplicon 

centered about exon 20 
SEQ ID NO: 62 nt 500 bp genomic amplicon 

centered about exon 21 
SEQ ID NO: 63 nt 500 bp genomic amplicon 

centered about exon 22 
SEQ ID NO: 64 nt 500 bp genomic amplicon 

centered about exon 23 
SEQ ID NO: 65 nt 1000 bp putative promoter 

SEQ ID NOs: 66 - 888 

nt 17-mers scanning nt 1 - 823 of 
PAPP-Ea 
SEQ ID NOs: 889 - 1711 

nt 25-mers scanning nt 1 - 823 of 
PAPP-Ea 
SEQ ID NOs: 1712 - 1796 

nt 17-mers scanning SEQ ID NO: 11 
SEQ ID NOs: 1787 - 1881 

nt 25-mers scanning SEQ ID NO: 25 
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Upon confirmation of the exact structure, 
each of the above-described nucleic acids of confirmed 
structure is recognized to be immediately useful as a 
PAPP-E-specif ic probe. 

For use as labeled nucleic acid probes, the 
above-described PAPP-E nucleic acids are separately 
labeled by random priming. As is well known in the art 
of molecular biology, random priming places the 
investigator in possession of a near-complete set of 
labeled fragments of the template of varying length and 
varying starting nucleotide. 

The labeled probes are used to identify the 
PAPP-E gene on a Southern blot, and are used to measure 
expression of PAPP-E isoforms on a northern blot and by 
RT-PCR, using standard techniques. 

EXAMPLE 3 
Production of PAPP-E Protein 

In parallel, each of the full length PAPP-Ea, 
PAPP-Eb, and PAPP-Ec cDNA clones is separately cloned 
into the mammalian expression vector pcDNA3 . 1/HISA 
(invitrogen, Carlsbad, CA, USA), transfected into COS7 
cells, transfectants selected with G418, and protein 
expression in transfectants confirmed by detection of 
the anti-Xpress™ epitope according to manufacturer's 
instructions. Protein is purified using immobilized 
metal affinity chromatography and vector-encoded 
protein sequence is then removed with enterokinase, per 
manufacturer's instructions, followed by gel filtration 

and/or HPLC. 

Following epitope tag removal, each of PAPP- 
Ea, PAPP-Eb, and PAPP-Ec proteins is present at a 
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concentration of at least 70%, measured on a weight 
basis with respect to total protein, and is free of 
acrylamide monomers, bis acrylamide monomers, 
polyacrylamide and ampholytes. Further HPLC 
purification provides PAPP-Ea, PAPP-Eb, and PAPP-Ec 
proteins at concentrations, respectively, of at least 
95%, measured on a weight basis with respect to total 
protein. 

EXAMPLE 4 
Production of Anti-PAPP-E Antibody 

Purified proteins prepared as in Example 3 
are conjugated to carrier proteins and used to prepare 
murine monoclonal antibodies by standard techniques. 
Initial screening with the unconjugated purified 
proteins, followed by competitive inhibition screening 
using peptide fragments of the PAPP-E isoforms, 
identifies monoclonal antibodies with specificities in 
each of the following categories: antibodies that 
recognize all PAPP-E isoforms, antibodies that 
recognize PAPP-Eb alone, and antibodies that recognize 
PAPP-Ec alone. 

EXAMPLE 5 

Use of hPAPP-E Probes and Antibodies 
for Antenatal Diagnosis of Aneuploidy 

After informed consent is obtained, 
peripheral blood samples are drawn from pregnant women 
at 14 weeks gestation and tested for PAPP-A levels by 
standard techniques and tested additionally for PAPP-E 
levels using anti-PAPP-E antibodies in a standard 
ELISA. 
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After pregnancy outcome is fully determined 
for all patients, tabulated results demonstrate a 
statistically significant decrease in the circulating 
maternal level of PAPP-E correlated with adverse 
5 outcome, and specifically with presence of either 

trisomy 18 or trisomy 21. Results further demonstrate 
that determination of concurrent PAPP-A and PAPP-E 
levels provides greater predictive value than either 
marker alone. 

10 In a second series of experiments, chorionic 

villus samples obtained for purpose of antenatal fetal 
karyotyping are further examined for expression of 
PAPP-E isoforms. 

In a first series of tests, total RNA is 

15 separately extracted from each CVS using a commercial 
kit and poly-A + RNA isolated with a commercial kit, A 
northern blot is prepared and probed with a 
radiolabeled PAPP-E nucleic acid probe. Specific 
hybridization is quantitated using a Phosphorlmager 

20 (Molecular Dynamics, Inc., Sunnyvale, CA, USA). RNA 

levels are normalized to expression levels of p-actin. 

After pregnancy outcome is fully determined 
for all patients, tabulated results demonstrate that a 
statistically significant decrease in chorionic villus 

25 PAPP-E expression is correlated with adverse outcome, 
and specifically with presence of either trisomy 18 or 
trisomy 21. Results further demonstrate that 
determination of maternal serum PAPP-E levels in 
conjunction with fetal chorionic villus expression 

30 provides greater predictive value than either marker 
alone . 

In a second series of tests, chorionic villus 
samples are prepared for immunohistochemical analysis 
by standard techniques. Anti-PAPP-E antibodies labeled 
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with alkaline phosphatase are used to visualize PAPP-E 
protein in the CVS samples by enhanced 
chemi luminescence . 

After pregnancy outcome is fully determined 
5 for all patients, tabulated results demonstrate a 

statistically significant decrease in chorionic villus 
PAPP-E protein expression is correlated with adverse 
outcome, and specifically with presence of either 
trisomy 18 or trisomy 21. Results further demonstrate 
10 that determination of maternal serum PAPP-E levels in 
conjunction with fetal chorionic villus expression 
provides greater predictive value than either marker 
alone. 



All patents, patent publications, and other 
15 published references mentioned herein are hereby 

incorporated by reference in their entireties as if 
each had been individually and specifically 
incorporated by reference herein. While preferred 
illustrative embodiments of the present invention are 
20 described, one skilled in the art will appreciate that 
the present invention can be practiced by other than 
the described embodiments, which are presented for 
purposes of illustration only and not by way of 
limitation. The present invention is limited only by 
25 the claims that follow. 



